The Chem4Word Project ( began in 2008 as a collaboration between Microsoft Research and the University of Cambridge. The project was designed to make it easier to insert and modify chemical information (labels, formulas, 2-D depictions, etc.) from within Microsoft Office Word, and also to have the chemical information stored and manipulated in a semantically rich manner.

The data is all stored as Chemical Markup Language ( and can be extracted using purely Open Source tools.

