Crystallography Open Database
Email : email@example.com
Dear IUCr Executive Committee Members,
In August 2003, the Advisory Board of the Crystallography Open Database (COD) will request permission for unlimited access to the IUCr CIFs. The CIFS are presently available by free access at the IUCr journals web server. With this letter, the COD Advisory Board Members wish to explain their motivations, and why and how they believe that this new global database will become an invaluable tool for the International Community of Crystallographers.
Our IUCr President writes in the Newsletter 10-4 (2002): "Emerging nations can benefit from the use of the powerful techniques of X-ray crystallography in order to analyze, understand and use the unique natural resources within their countries whether mineralogical, chemical, or biological in nature". As stated in the first letter sent by the COD Advisory Board Members to Bill Duax (April 2003), this generous wish could more easily be achieved through free access to a global crystallography open database (including all inorganic, metallic, organometallic and organic compounds) which is the purpose of our new COD web server located at http://www.crystallography.net/ offering atomic coordinates in the IUCr CIF files format, together with a search engine.
The initial position of COD was to develop the database through individual or laboratory-based contributions. Rather than systematically copying the atomic coordinates from the literature, individual authors and laboratories are encouraged to send their own CIFs. Thanks to a few major donators (the American Mineralogist Crystal Structure Database for >4000 entries, the Institut de Physique de la Matière Condensée -Grenoble- for >1200 entries, the CRISMAT -Caen- for >850 entries and the Laboratoire des Fluorures -Le Mans- for >450 entries), as well as to many individual contributors, the COD already contains more than 12.000 entries. This is, however, only 3% of the 400.000 known published structures. Permission from IUCr would more than double the current COD content.
Currently, a search in COD is completed by two powerful options:
We will ask the IUCr for one of two options:
There are some data quality issues that need to be considered. Currently, COD expects a donor to upload to the COD only his best data (CIFs as built by the refinement software, and thus free of typos at least on atomic coordinates and cell parameters). We assume that the donor of the data does the quality check. The COD is confident that Crystallographers will demonstrate a high degree of professionalism and autonomy.
About durability of the COD, the current ten Advisory Board Members offer some guarantee. The open nature of the COD, which can be downloaded either in part or completely, including all data and source code, is also an argument favouring its survival. The whole COD system is based on open source software (Apache/MySQL/PHP) and can be used by interested parties (contact firstname.lastname@example.org). It is simple to install and would be quite useful to laboratories wanting to build a crystal database from their own crystal structure determinations, for either an intranet or an internet external access (or for building any crystal subset of interest to specialized Crystallographers such as zeolites, phosphates, fluorides, nitrates, carbonates, sulphates, etc). Collecting all such laboratory fragments in the global COD would offer an invaluable service to the world crystallographic community and particularly in emerging nations. The CRYSTMET, ICSD and CSD databases offer much more complete and powerful services than COD, which is limited to the minimum information allowing a crystallographer to survive at low cost. Even crystallography journals (not only IUCr journals) may decide in the near future to open their own database of published crystal structures in a searchable way. The journals interest is to sell articles, so that it seems clear that they may decide to open a database limited to REF files, the client wanting atomic coordinates would have to buy a copy or the published paper. REFs may include compounds in the COD that were not completely structurally characterized (no atomic coordinates, but with chemical formula, cell parameters and space group proposed by isotypy with a fully characterized analogue, etc). In this way, the COD would even be more complete than databases distributed for fees.
With this letter, we have presented to you a vision of free and easy access to the crystallographic data via the Internet. Once the database has developed to some maturity, it is our dream that its use will promote the science of crystallography at all levels of education and to all parts of the world. We envision the use of software, intimately associated with CIFs that will promote and ease the interface to understanding our exciting profession. We hope that as members of the IUCr executive committee, you can share our vision and grant COD access to the data files published in the IUCr journals. For more details about the COD, REF files and so on, visit the web site: http://www.crystallography.net/
The COD Advisory Board :
M. Berndt (Germany), D. Chateigner
(France), X.L. Chen (China), M. Ciriotti (Italy), L.M.D. Cranswick (Canada),
R.T. Downs (USA), A. Le Bail (France), L. Lutterotti (Italy), H. Rajan
(USA), A.F.T. Yokochi (USA).