Transparency of chemical risk assessment data under REACH

The REACH regulation requires EU manufacturers and importers of substances to register information on the hazard and risk of their substances with the European Chemicals Agency (ECHA). Risk management of the substances is based on the provided information. It is known that conclusions on hazard and risk are influenced by expert judgements as well as potential conflict of interests. Thus, it is important that hazard and risk assessments are transparent and can be evaluated by a third party. The aim of this study is to scrutinize the transparency, i.e. the accessibility and comprehensibility, of information on substances registered under REACH. Data on repeated dose toxicity and hazard assessment conclusions were extracted for 60 substances from the REACH registration database available on the ECHA website. The data were compiled in a database for systematically evaluating the transparency of information related to the conclusions on hazard or risk. In addition, chemical safety reports (CSR) were requested from ECHA for five substances. The transparency of information on the hazard and risk of substances was found to be limited for several reasons. First, certain information was removed due to confidentiality and certain fields were not published because they could contain confidential information although the information had not been claimed confidential. Also, the extent to which registrants reported information varied, and the presentation of some data and certain terminology required further clarification. In addition, the data source for the majority of the key and supporting studies could not be identified due to confidentiality. Since registrants are only required to summarise studies, it cannot be verified whether all relevant information from non-public industry reports have been reported. Lastly, certain information related to the hazard and risk assessment were only reported in the CSR which is only available upon request; a time-consuming and work-intensive process. As information on registered chemicals is currently provided to the public, it is difficult to follow steps that are undertaken in the hazard and risk assessment. This limits the possibility for a third party to evaluate the assessment.


Introduction
The European chemicals legislation REACH (regulation (EC) no 1907/2006 concerning Registration, Evaluation, Authorisation and restriction of CHemicals) requires industry to register and provide information on substances they produce or import into the EU at or above one tonne per year. 1 The information is compiled and submitted as a registration dossier to the European Chemicals Agency (ECHA) in the soware programme IUCLID. 2 One objective of REACH is to make information on the hazards and risks of chemicals available to the public 3 specied in REACH article 77(e) and recital 117.Information on substances becomes publicly available either by dissemination on the ECHA website through the dissemination portal 4 (referred to as the REACH registration database in this paper) or by ECHA granting access to certain documents upon request.Before information from the registration dossiers is published on the website, the dossiers undergo an automatic ltering step in which information from the dossier that is not intended to be published or considered condential is removed. 5The ltering step is based on articles 118 and 119 in REACH which specify information that is never to be published due to condentiality, information that is always considered public and information that is published unless claimed condential.
The REACH registration database provides a wealth of information on chemicals put on the European market.][8][9][10][11][12][13] In a study performed by the German Federal Institute for Risk Assessment (BfR), in which the availability of data for high tonnage volume substances were screened, over half of the registered substances were found to lack information required by REACH for one or more of the endpoints that were evaluated in the project. 7In a ve year report on REACH published in 2016, ECHA emphasised the insufficient data quality for "a signicant proportion" of the registration dossiers. 8he physicochemical and (eco)toxicological information on substances and the subsequent hazard and risk assessment submitted to ECHA by registrants form the basis for how chemicals covered by REACH are regulated and managed within the EU.0][21][22] Hence, it is important that underlying assumptions and reasoning in hazard and risk assessments as well as the data on which the assessments are based are explicitly stated and transparent to stakeholders, such as policy makers, academic researchers and NGOs. 23,24he aim of this study was to assess the transparency of information on chemical substances submitted by industry to ECHA under REACH focusing on disclosure and clarity of information.This was done by looking at (1) dissemination of information in the REACH registration database on the ECHA website for data registered on the endpoint repeated dose toxicity (RDT) as well as hazard assessment conclusions for a subset of 60 substances and (2) accessibility of chemical safety reports (CSR) that are available on request.A CSR is required for substances manufactured or imported at or above 10 tonnes per year and documents the chemical safety assessment.The overall purpose of the study was to contribute to improving accessibility and comprehensibility of hazard and risk assessment of chemicals since this enables other stakeholders to take part of how registrants come to their conclusion and evaluate the accuracy of the assessment.The accuracy of the gathered information was not investigated in this study.

Research questions and scope of study
The transparency of data published in the REACH registration database was examined by investigating (a) the possibility to identify and access the data source for studies on RDT that are submitted as part of the registration dossier, (b) the justications for omitting data on RDT from the REACH information requirements, known as "waiving" and (c) how hazard assessment conclusions were derived and reported.The study was limited to a subset of 60 substances (ESI Table 1 †) with a focus on data submitted for the endpoint RDT, which was a standard information requirement for all substances included in the study.

Data selection, collection and compilation
Of the 60 substances included in the study, 51 were selected from a list of 178 European Community (EC) numbers that were categorized depending on data availability.The list was provided by BfR, from their investigation of data availability of dossiers under REACH. 11Equal number of substances, i.e. 17, were randomly selected from each of three categories to have a selection of dossiers with varying degrees of reporting.Another nine substances were selected since they are subject to authorisation and/or restriction processes under REACH and are well investigated substances. 4ata up until the 1st of September 2015 were extracted for the 60 substances from the REACH registration database. 4For the majority of the substances, information was extracted from joint dossiers (registrants who manufacture or put the same substance on the market need to share certain information and make a joint submission of the dossier via an appointed lead registrant according to the principle "one substance, one registration", REACH art.11).For four of the substances, only data from individually submitted dossiers were available.The extracted information was compiled in a Microso Access database, designed for the purpose of this study, to enable a quantitative analysis of the information.
A request for access to full CSRs pursuant to regulation (EC) no 1049/2001 on public access to documents 25 was made electronically to ECHA on 11th of September 2014 for nine of the substances. 26The request was withdrawn aer correspondence with ECHA and a new request was made on 29th of September 2014. 27The new request included full CSRs for ve substances.

Type of data registered for repeated dose toxicity (RDT)
Data needed to full the information requirements under REACH are recorded in the IUCLID standard format "endpoint study record" (ESR) which consists of predened as well as free text elds. 28For the 60 substances, a total of 499 ESRs were reported for the endpoint RDT (Table 1).The majority of the ESRs (435) were summaries of experimental studies of which 362 ESRs were specied as "experimental study" and 48 as "read-across" (in read-across, the registrant uses information from a similar substance to predict properties or effects for the target substance to full the information requirements 29 ).For the remaining summaries of experimental studies (25 ESRs), no study result type was reported and thus labelled as "not speci-ed" in this study.Studies were reported as key, supporting, weight of evidence and disregarded depending on how they were used in the hazard assessment.For 32 ESRs, the use of the View Article Online study was not reported and therefore labelled as "not specied" in this study.In addition to the experimental data, "data waiving" was reported for 62 ESRs and for two substances an experimental study was planned (two ESRs).For 41 of the 60 substances, information was provided for all three major exposure routes: oral, inhalation and dermal (ESI Table 2 †).For the remaining 19 substances, information (i.e.experimental studies and data waiving) was lacking for one or two of the three exposure routes.
3.1.1Identication and accessibility of data source.In total, 523 citations of data sources were recorded for 433 ESR summaries of experimental data reported for RDT (Fig. 1).One substance had two ESRs in which no information on the data source was reported in either.In some ESRs, several references were cited.Since it was not possible to uniquely identify all data sources, the results from this study are reported as number of citations and ESRs (study summaries).
It was not possible to identify 328 out of 523 (63%) citations, since information on author, title and/or bibliographic source was missing or incomplete (Fig. 1).The majority of the citations, 284 out of the 328 (87%), that were not possible to identify referred to study reports and company data, i.e. data owned by the industry.The remaining 31 (9%) incomplete citations referred to publications.Data sources that were possible to identify mainly referred to publications.
One of the substances, diethylhexyl phthalate (DEHP; EC number 204-211-0), had a substantially higher number of ESRs (85) and data source citations (103) than the other substances included in the study (Fig. 1 and Table 1).Since the majority of the citations for DEHP, 70 out of 103 (68%), referred to publications (Fig. 1), it was possible to identify at least one of the references (if several references were cited for an ESR) for 65 out of 85 (76%) of the ESRs (Table 1).Thus, for DEHP, the reference could be identied and accessed for three out of the four ESRs Table 1 Data reported for the endpoint repeated dose toxicity for 60 substances.Data reported in endpoint study records (ESRs) in the REACH registration database up until 1st of September 2015 for the endpoint repeated dose toxicity for 60 substances (summary of ESI Table 1 †).The substance DEHP (EC number 204-211-0) is reported separately since a higher number of endpoint study records (ESR) was reported for DEHP than for the other substances (85 out of 435 ESR constituting 20% of all ESR).For experimental data (flagged as experimental data, read-across, weight of evidence and/or not specified), the accessibility of the data source refer to at least one of the data sources reported in the ESR (one ESR may refer to several data sources).Data sources that are always inaccessible and not possible to identify (i.e.information on author, title and bibliographic source are not disseminated) include industry data (study reports and company data).Some of the inaccessible data sources referred to literature that is publicly available (e.g.publications), but information to identify the data source was not provided, e.g.author, title and bibliographic source.The category "accessible data source" includes data sources for which author, title and/or bibliographic reference were possible to identify (e.g.publications and governmental risk assessments).For the remaining 59 substances included, it was not possible to identify the data source for 240 out of 350 (69%) ESRs (for reported study result types "experimental", "readacross" and "not specied" in Table 1).For key studies, the data source could not be identied for 118 out of 147 (80%) ESRs since the majority of these referred to study reports or company data.For 14 of the 118 (13%) key studies for which the data source could not be identied, referred to at least one public data source (ESI Table 1 †).Hence, it was only possible to identify a cited data source for 29 out of 147 (20%) key studies.Slightly more data sources were possible to identify for supporting studies: 45 out of 147 (31%) ESRs.

Study result type
Dissemination.The number of citations was not equal to the number of unique studies for several reasons.First, a data source may be cited more than once due to cross-reference to the same study.Second, a study can comprise results on many endpoints which can be recorded under the same or several endpoint sections, or include several experiments. 28Third, for industry reports that have been published, a full reference to the publication should be made in addition to the original study.Since, information on many of the data sources were incomplete, it was not possible to identify the number of unique studies.
The data source for a study summary needs to be reported in the ESR.However, the type of data source reported in the eld "reference type", e.g.study report, publication, and company report, inuences what information on the data source will be published on the website. 30The elds for reference type, year and report date are always disseminated for all study summaries regardless of the type of data source.However, author, title and bibliographic source are not automatically disseminated if (1) the IUPAC name of the substance, i.e. the name according to the system developed by the International Union of Pure and Applied Chemistry to unambiguously describe the structure of the substance, is claimed to be condential, (2) the ESR is claimed as condential except when the reference type refer to publication, or review article or handbook, (3) the reference type is study report or company data, or (4) if information is given in any of the following elds that are not disseminated on the website: "testing lab", "report number", "owner company" or "study number". 30,31Thus, the origin of study summaries based on industry data are not possible to identify.For the incomplete citations referring to publications, it cannot be conrmed whether the information on the data source was missing due to insufficient reporting on behalf of the registrant or due to ltering according to the dissemination rules.Since the elds "testing lab", "report number", "owner company" or "study number" are not disseminated, it cannot be veried whether any information had been provided in these elds and consequently ltered information on author, title and bibliographic source.
3.1.2Data waiving.For 37 of the 60 substances, registrants had waived information for one or more of the exposure routes (dermal, inhalation, oral or other route) amounting to 62 waiving ESRs (Table 1 and ESI Table 2 †).For each waiving ESR a statement of one of the following categories was provided: "study technically not feasible", "study scientically unjusti-ed", "exposure considerations" and "other justication".The most common waiving category was "other justication", 35 out of 62 waiving ESRs.No further information on data waiving was provided except for one substance which also, in addition to the data waiving category, justied the data waiving for two ESRs.Data waiving justications were found to be reported in the CSRs unless claimed condential and consequently redacted.
Dissemination.Data waiving is reported in IUCLID in two elds: "data waiving" and "data waiving justications".The eld "data waiving" is a pick list eld with four broad categories which is disseminated unless claimed condential. 31,32In addition, registrants are required to report a justication for waiving data.However, this eld is not automatically disseminated on the website. 31According to the lter rule applied to this eld, the information is either considered to undermine the commercial interests of the registrant or is not related to the hazard and safe use of the substance. 5Hence, data waiving justications are not accessible in the REACH registration database unless the registrant has reported the justication in another text eld in the ESR that is automatically disseminated.

Hazard assessment conclusions
Registrants need to assess the hazard for particular target groups (workers and the general population) through the major exposure routes (oral, dermal, inhalation and eyes) and for certain types of effects (acute, chronic, local, systemic) referred to as "exposure patterns". 33In total, 763 hazard assessment conclusions for various exposure patterns were reported for 57 of the 60 substances (ESI Table 4 †).For three of the substances, no hazard assessments were provided.The number of exposure patterns reported varied between substances and for some exposure patterns no hazard assessment conclusion was View Article Online reported.The hazard assessment conclusions were reported as broad, pre-dened categories, such as "DNEL" (derived noeffect level), "DMEL" (derived minimal effect level) "medium hazard" and "hazard unknown (no further information necessary)" (Table 2).In addition to the hazard assessment conclusion, information on the endpoint on which the assessment is based and the route of the original study could be provided.However, information in these elds was mostly provided for the DNEL conclusions only.
In total, 292 DNEL and 5 DMEL conclusions abbreviated as DN(M)ELs were recorded for 52 of the substances (Table 3).The derivation of the DN(M)EL conclusions could be described in further detail by providing the overall assessment factor as well as specic assessment factors applied with justications, and the dose descriptor starting point value, e.g. the no or lowest observed adverse effect level, NOAEL or LOAEL.The level of detail on the derivation of the DN(M)EL conclusions reported by the registrant varied between dossiers and also between exposure patterns within the dossier.For three DNEL and DMEL conclusions, no DN(M)EL value was reported in the value eld.An overall assessment factor was reported for 221 of the 297 (74%) DN(M)EL conclusions, and for 72 (24%) of the conclusions the assessment factor was further specied and justied.The content of free-text elds for justifying specic assessment factors also varied, ranging from one sentence to paragraphs with detailed description of reasoning and/or calculations.For 199 out of 297 (67%) of the DN(M)EL conclusions, the type of dose descriptor starting point (e.g.NOAEL) was given but only 64 out of 297 (22%) specied a value for the dose descriptor starting point.The endpoint on which the hazard assessment was based was reported for 90% of the DN(M)ELs.
Generally, more comprehensive information on the justications and discussions of the hazard assessment conclusions were provided in the CSRs in which also the study on which the conclusions was based was referred to, if reported by the registrant.
For three substances (EC number 201-553-2, 201-557-4 and 271-094-0), two summaries of toxicological information, labelled "toxicological information 1" and "2" respectively, were provided on the website with sometimes differing hazard assessment conclusions for the same exposure pattern (ESI Table 4 †).For one of the substances, one of the summaries was labelled "JS Member".The label, although not explained on the Table 2 Information reported for the hazard assessment in the REACH registration database.Reported information for hazard assessment conclusions up until 1st of September 2015 in the REACH registration database for 57 out of 60 substances (summary of ESI Table 4 †).In total, 763 exposure patterns (exposure route/exposure group/type of effect/exposure duration) were reported.For three of the 60 substances, no toxicological summary was provided.website, means that the registrant has made an opt-out from the lead dossier.
3.2.1 Dissemination.Registrants are required to derive DNELs and DMELs for relevant exposure patterns and should justify why a certain exposure route and target group is considered as relevant and other are not. 34However, the free text elds for providing the justications in IUCLID, "justication and comments" and "discussion", are not automatically disseminated on the website. 31Hence, the hazard assessment conclusions can only be understood from the broad pre-dened categories.For DN(M)EL conclusions, the DN(M)EL value cannot be explicitly linked to the study summary from which the DN(M)EL is derived based on the information disseminated on the website.It may be linked if the registrant has reported information on the endpoint on which the assessment is based, the exposure route of the original study, and dose descriptor starting point and by performing back calculations using the DNEL value and the overall assessment factor.However, even when this information is provided, the process of identifying the study still involve some uncertainty.In IUCLID, registrants need to summarise and highlight key study(ies) for each toxicological endpoint on which the hazard assessment conclusions are based in the endpoint summary.The endpoint summary has previously not been disseminated 31 but will be published from June 2016 for new and updated dossiers. 35The dossiers for the substances included in this study had as of 5th of September 2016 not been updated and thus no study summary was disseminated on the website.The summaries will be re-processed for all dossiers aer 2018. 30

Access to CSRs
The rst request for CSRs comprised nine substances.This request was withdrawn aer consultation with ECHA since it proved difficult for ECHA to disclose the documents within the legislative time frame due to the length of the documents and the obligation to consult with the registrants. 26The regulation (EC) no 1049/2001, which regulates the accessibility of documents held by EU institutions to the public, sets a time frame of 15 working days for processing requests, by either granting access to the documents or informing the applicant why the request is being partially or totally refused, art.7(1). 25In cases of extensive requests (several or very long documents) institutions may negotiate the requests to seek a "fair solution", art.6(3), as well as extend the time limit with an additional 15 days, art.7(3).Consequently, the second request was restricted to ve substances (Table 4). 27ECHA identied 28 CSRs for the ve substances consisting of 100 pages each on average.In order to shorten the time frame for processing the request and limiting the administrative work required, the scope of the request was rened to CSRs provided by the lead registrants (in total ve CSRs).Nevertheless, it was foreseen by ECHA that the request could not be processed within the time limit due to the expected work load.Therefore, ECHA proposed to disclose the ve CSRs within 60 days.
Access to non-condential versions of the ve CSRs was provided aer 57 working days.The CSRs were electronically disclosed with a letter that explained certain redactions of the documents due to condentiality (Table 4). 27Information in the CSR that was approved as condential had been redacted manually with black boxes.Information on authors of unpublished studies were blanked out in all documents.In one or several of the documents, information related to the composition of the substance, tonnage production, manufacturing process and details in specic endpoint sections as well as exposure scenarios regarding manufacture was redacted.

Discussion
ECHA recognises transparency as one of the Agency's core values and is committed to continuously enhance transparency by improving dissemination of information on substances. 36,37s a result, ECHA launched in June 2016 new dissemination pages to present information on substances in the REACH registration database in a simpler and more accessible way to improve the usability of the information, particularly for non-scientic users. 38,39One of the new changes in the dissemination pages addresses one of the aspects highlighted in this study.However, despite this change it remains difficult to follow certain steps that registrants undertake in assessing the hazard and risk of substances as information is currently  provided to the public.The transparency of the hazard and risk assessment of substances under REACH was shown to be inuenced by (1) condentiality, (2) reporting by registrants, (3) lack of clarity of disseminated information, (4) verication of information and (5) accessibility to information.

Condentiality
ECHA states in their approach to transparency that it needs to be balanced with other aspects, such as the laws protecting condential business information, intellectual property rights and personal data. 37However, condentiality restricts the access to information that is crucial for understanding the conclusions drawn in the hazard and risk assessment.In one of the CSRs the discussion of the results of some studies as well as data waiving justications were claimed condential and thus the text was partially or fully blanked out.This means that it is only ECHA and member states competent authorities that have full access to the information and thus can evaluate and scrutinise justications and conclusions provided in the assessments.
Condentiality also restricts the possibility to identify the data source of the studies used in the hazard and risk assessment.For the majority of the key studies included in this study, it was not possible to identify the data source since most of the key studies were industry studies for which only information on reference type (i.e.study report) and year of publication was disseminated.Details on the data source of industry studies are not disseminated due to protection of personal data (names of authors, affiliation/and or laboratory) and commercial interests (the title may contain the IUCLID name which can be condential).In general, industry studies are used and preferred over other types of data when conducting regulatory risk assessment since they are performed according to standardised test methods and GLP and therefore considered to be reliable. 40hus, if the results from this investigation are representative, then much of the in vivo toxicological data used as key and ESI † in the hazard and risk assessments under REACH are not possible to identify and scrutinize.
The lter rules that are applied to dossiers before they are published on the website also remove information that may be condential without taking into consideration whether the information in fact is condential.For example, the justication elds for data waiving and hazard assessment conclusions are not automatically disseminated on the website 31 because the information is either considered to undermine the commercial interests of the registrant or not related to the hazard and safe use of the substance according to the lter rule. 5ECHA stated, in a response to an enquiry made by the organisation Cli-entEarth, that data waiving justication elds were not disseminated since they could contain condential information. 9Since the hazard assessment conclusion justication is related to the hazard and safe use of the substance, it is likely that this eld is not disseminated for the same reason as for data waiving justications although these justications are reported in the CSRs.Since the CSRs are manually reviewed, only information that has been claimed and approved as condential during the consultation with the registrant is blanked out.Thus, information that is not condential but important for understanding the steps in the hazard and risk assessment are not accessible to the public through the ECHA website but only by requesting access to the CSRs.
This lter rule has until recently also applied to the endpoint summaries which links the DN(M)EL derivation to the toxicological study on which the calculation is based.So far, the study can only be indirectly identied through other elds describing the derivation of the DN(M)EL.The difficulty to identify the study on which the DN(M)EL is based has also been highlighted in previous studies. 12,18However, from June 2016, endpoint summaries will be disseminated for new and updated dossiers as part of ECHA's work on improving transparency.This will hopefully enable the identication of the study used for deriving the DN(M)EL.Nevertheless, what elds will be disseminated from the endpoint summaries still remain to be seen.However, the elds for DNEL justications and conclusions will as previously not be published 30 and access to these elds is also important to fully understand the assessment made.
Although condentiality may be warranted in certain situations, any non-disclosure of information needs to be weighed against the citizen's right to know, the consequences of keeping information condential and thus the benets of transparency to the society as a whole.What is considered to be condential has varied over time as well as between legislations and Member States and therefore is subject to renegotiation.In cases where transparency is limited due to condentiality, it is important that the condentiality claims can be scrutinised.On the new dissemination pages, ECHA indicates what information has been approved as condential and consequently is removed.Transparency would be further enhanced if the justication for claiming the information as condential as well as the ECHA decision approving the claim also were to be published just like other ECHA decisions are published on their website.

Reporting of information
Information can only be disseminated as long as it has been reported by the registrant.The extent of reporting varied between dossiers as well as within certain sections of the dossier.For three substances, no hazard assessment conclusions were reported although required for substances produced at or above 10 tonnes per year.The number of hazard assessment conclusions reported as well as the information provided for DN(M)ELs (e.g.assessment factors and justications) also varied.For some substances, the registrants did not specify how the information was used in the assessment, i.e. whether studies were used as key or supporting studies or altogether disregarded, and certain bibliographic references were only partly reported.Incomplete reporting inevitably affects the possibility to understand and evaluate the hazard and risk assessment made by the registrant and thus is a prerequisite for transparency.In certain cases, it was difficult to assess whether information was missing due to incomplete reporting or due to the lter rules applied in the dissemination process.
The problem with reporting has been brought up in an appeal before the Board of Appeal (BoA).A registrant contested ECHA's decision to consider another registrant's registration dossier as complete although it lacked basic data on physicochemical and toxicological data according to the standard information requirements. 41Since the automated IT system in the completeness check only veries that information has been provided in the elds and the subsequent compliance check only covers a minimum of ve percent of the dossiers registered in each tonnage band, this means that dossiers can be approved without fullling all legal requirements.The BoA noted in their decision that the current completeness check with the automated IT system is not sufficient to full ECHA's obligation to verify that dossiers are complete. 41ECHA has announced that they will from now on manually verify the completeness of dossiers in addition to the automated completeness check. 8

Clarity
An important aspect of transparency relates to the clarity and comprehensibility of the published information. 42In the new dissemination pages, ECHA has worked on presenting information on substances in a simpler and more comprehensible way particularly for non-scientic users in an "info card" and "brief prole".However, the raw data from the dossiers are still partly displayed in a way that makes it difficult to follow the reasoning in the hazard and risk assessment and understand the specics.The data waiving and hazard assessment conclusions were provided as broad pre-dened categories of which most of the categories are not self-explanatory and sometimes difficult to nd the explanation for in the ECHA guidance documents.Clarity would be improved by providing pop-up information windows on the dissemination pages for the raw data source for explaining the meaning of various elds as well as where in the legislation and the guidance more information can be found.Such pop-up windows are currently provided in the info card and brief prole.Although, explaining the application of broad categories will not provide the reason as to why a certain category is applicable in each particular case.This can only be resolved by disseminating the actual justication which is currently removed from the website for condentiality reasons during the dissemination process.
ECHA's decision to publish endpoint summaries will hopefully clarify why no data were submitted for some of the exposure routes and why certain exposure patterns were not considered to be relevant for the assessment.However, it will not explain the multiple summaries of toxicological information with sometimes different hazard assessment conclusions for the same exposure pattern which were published for three of the substances included in the study.

Verication of information
Accurate reporting of physicochemical properties and results from toxicity studies is important in order to assess the hazards and risks of a substance and the possibility to verify data is central to instill stakeholder trust.In REACH art. 3 (29), registrants are not required to provide full study reports, but rather a summary of the study.Thus, information in summaries based on industry reports 42 cannot be veried since such studies are intellectual property rights of the company.Moreover, sufficient information to enable an independent assessment of the study is only required for robust study summaries such as key studies, REACH art. 3 (28).Consequently, the system places high trust in data that cannot be veried and could potentially be biased due to conict of interest.In a study which looked at the registration dossiers of ve endocrine disrupting substances, ClientEarth found that important information in the original studies had not been included in the study summary. 9Although ECHA can request information required under REACH, art.36, it would be resource and time-consuming for the agency to validate and verify all submitted data.

Accessibility to information
The disclosure of information on substances through the REACH registration database greatly increases the availability and accessibility of chemical data to the public.However, some information from the registration dossier is not accessible via the website, but only through the CSR.This includes information that is not condential but ltered in the dissemination process, as well as the exposure assessment (although information on exposure scenarios will be published from June 2016) and risk characterisation required for substances fullling certain hazard categories under REACH art.14(4).
The current system for retrieving CSRs from ECHA is workintensive and time-consuming which restricts the amount of information that the public can request and how readily it can be provided.Therefore, it is advised to nd a solution that will provide the same information in a less work-intensive and timely manner, for example by requiring registrants to submit a non-condential version of the CSR.

Conclusion
One of the objectives of REACH is to increase transparency by making information on substances accessible to the public, which is realised by disseminating information on the ECHA website.Thus, basic information on the hazards of the substances and common uses is summarised in info cards and brief proles in an accessible way to the general public.However, it is also of interest to know how registrants reach their conclusions in the hazard and risk assessments.As shown in this study, the information underlying conclusions on hazards and risks for substances registered under REACH is accessible to the public in a semi-transparent way that makes it difficult for a third party to fully evaluate the registrant's assessment.First, the protection of commercial interests by keeping information condential severely limits the accessibility to important information.Furthermore, non-condential information is removed in the automatic ltering step in the dissemination process of the dossiers as a precaution because they may contain condential information.We suggest that ECHA review the ltering step so that only information that is truly claimed or always considered to be condential is removed.Second, poor reporting by registrants affects the availability of data.Improving the completeness check of dossiers will be an important step towards ensuring that the required data is also provided.Third, the information provided in the dossier is not always clear and could be further explained to improve the comprehensibility of the information.Fourth, the reference for the majority of the studies could not be identied since they referred to industry data that are intellectual property of the company.For this reason, it is not possible to verify if all important information to evaluate the study has been provided since the registrants are only required to submit a summary of the studies.It would be preferable if all data on which assessments were based were available for public scrutiny and evaluation.Fih, the process of requesting CSRs that contain information that is currently not disseminated on the ECHA website is work-intensive and time-consuming.Thus, it cannot be guaranteed that information that in theory is non-condential can be accessed.Publishing a non-condential version of the CSR on the website would make more information on the hazard and risk assessment of the substances easily accessible to the public.Since it is known that hazard and risk assessments are inuenced by expert judgments and conict of interests, it is important to enable other stakeholders to evaluate the scientic robustness of the assessment.Some of the recommended/proposed changes require a change in the regulation, others fall under ECHA's mandate to change.

Fig. 1
Fig. 1 Data source citations.For 60 substances, a total of 523 citations of data sources were recorded of which 63% were not possible to identify, i.e. title, author and/or bibliographic source were not provided (ESI Table 3 †).
Reported categories in the eld " Hazard Assessment Conclusions" (HAC) Total number of HAC reported Reporting of the eld "Most sensitive endpoint" Reporting of the eld "Route of original study" Total number of exposure patterns 763 309 82 1512 | Environ.Sci.: Processes Impacts, 2016, 18, 1508-1518 This journal is © The Royal Society of Chemistry 2016 Environmental Science: Processes & Impacts Paper Requested chemical safety reports (CSRs) from ECHA.Type of information blanked for confidentiality reasons in the five non-confidential CSRs requested from ECHA 29th of September 2014 and accessed 22nd of December 2014.Redactions of CSRs EC number Information blanked out (for one or several of the substances) purity, composition of the substance, constituents, impurities and additives and their concentration range respectively.information through which the precise tonnage can be back-calculated (e.g.PEC values and risk characterisation ratios).Protection of commercial interests (article 4(2) rst indent of regulation (EC) No 1049/2001 and article 118(2) of the REACH regulationrst indent of regulation (EC) no 1049/2001).Certain information in specic endpoint sections x The discussion of the results of some of the studies.Protection of commercial interests including intellectual property (article 4(2) rst indent of regulation (EC) no 1049/2001).Certain information related to the exposure scenario for manufacture x Information on the production conditions and safety measures (e.g.product characteristics, frequency and duration of use, conditions and measures related to the evaluation of personal protection, hygiene and health). of unpublished studies and other details (e.g.names of testing laboratories and owner companies).Protection of personal data (article 4(1)(b) of regulation (EC) no 1049/2001) or protection of commercial interest (article 4(2) rst indent of regulation (EC) no 1049/2001).

Table 3
Information reported for DNEL and DMEL hazard assessment conclusions in the REACH registration database.Reported IUCLID fields in the REACH registration database up until 1st of September 2015 in the toxicological summary "toxicological information" for the hazard assessment conclusions DNEL and DMEL for 52 out of 60 substances (summary of ESI Table 5 †).For eight of the 60 substances, no DNEL or DMEL hazard assessment conclusions were reported.
Total number of ESR with DNEL and DMEL hazard assessment conclusions 297 a For example NOAEL.This journal is © The Royal Society of Chemistry 2016 Environ.Sci.: Processes Impacts, 2016, 18, 1508-1518 | 1513 Paper Environmental Science: Processes & Impacts Open Access Article.Published on 07 November 2016.Downloaded on 10/31/2019 12:40:51 AM.This article is licensed under a Creative Commons Attribution 3.0 Unported Licence.