<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-8942442068905528713</id><updated>2011-11-17T09:16:01.659+01:00</updated><category term='CDK'/><category term='Python'/><category term='chembl'/><category term='clustering'/><category term='Tanimoto'/><category term='chemoinformatics'/><category term='fragmentation'/><category term='rcdk'/><category term='fingerprints'/><category term='virtual screening'/><category term='descriptors'/><category term='MMsINC'/><category term='database'/><category term='substructure search'/><title type='text'>alchemoinformatics</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>29</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-3071715472838256402</id><published>2011-05-28T19:37:00.000+02:00</published><updated>2011-05-28T19:37:43.728+02:00</updated><title type='text'>pepMMsMIMIC paper is out</title><content type='html'>The &lt;a href="http://mms.dsfarm.unipd.it/pepMMsMIMIC/"&gt;pepMMsMIMIC&lt;/a&gt; paper now can be accessed from the &lt;a href="http://nar.oxfordjournals.org/content/early/2011/05/27/nar.gkr287.abstract"&gt;Nucleic Acid Research website&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-3071715472838256402?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/3071715472838256402/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/05/pepmmsmimic-paper-is-out.html#comment-form' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/3071715472838256402'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/3071715472838256402'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/05/pepmmsmimic-paper-is-out.html' title='pepMMsMIMIC paper is out'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-5576626019684576898</id><published>2011-04-26T10:08:00.000+02:00</published><updated>2011-04-26T10:08:52.851+02:00</updated><title type='text'>A novel web-oriented peptidomimetic compound virtual screening tool.</title><content type='html'>&lt;div style="margin-bottom: 0cm; text-indent: 0.5cm;"&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;a href="http://mms.dsfarm.unipd.it/pepMMsMIMIC/"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;pepMMsMIMIC&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt; is a public, web-based virtual screening platform with the aim to suggest chemical &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;compounds whose essential elements (pharmacophore) mimic a natural peptide or protein in 3D space  which hopefully retain the ability to interact with the biological target and produce the typical biological effect.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div align="LEFT" class="body-text1" style="text-indent: 0.5cm;"&gt;&lt;span style="font-size: small;"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;Starting from the 3D structure of any protein-protein/peptide complex,&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;a href="http://mms.dsfarm.unipd.it/pepMMsMIMIC/"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;pepMMsMIMIC&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;&amp;nbsp;design process begins by identifying the key residues that are responsible for the protein-protein recognition process. In this process, the peptide complexity is reduced and the basic pharmacophore model is defined by its critical structural features (&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;i&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;peptide &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;i&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;annotation points&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;) in 3D space.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div align="LEFT" class="body-text1" style="text-indent: 0.5cm;"&gt;&lt;span style="font-size: small;"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;The&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;a href="http://mms.dsfarm.unipd.it/pepMMsMIMIC/"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;pepMMsMIMIC&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;&amp;nbsp;paper has been accepted for publication in the NAR Web Server Issue 2011. I will post the Advance Access link in the near future.&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div align="LEFT" class="body-text1" style="text-indent: 0.5cm;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;Here the abstract:&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;div align="LEFT" class="body-text1" style="text-indent: 0.5cm;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;blockquote&gt;&lt;div class="abstract-text-western" style="text-indent: 0cm;"&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;pepMMsMIMIC is a novel web-oriented &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;peptidomimetic compound&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt; virtual screening tool based on a multi-conformers 3D- similarity search strategy. Key to the development of pepMMsMIMIC has been the creation of a library of 17 million conformers calculated from 3.9 million  commercially available chemicals collected in the MMsINC&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;sup&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;®&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/sup&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt; database. Using as input the three-dimensional structure of a peptide bound to a protein, pepMMsMIMIC suggests  which chemical structures are able to mimic the protein-protein recognition of this natural peptide using both pharmacophore and shape similarity techniques. We hope that the accessibility of&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;a href="http://mms.dsfarm.unipd.it/pepMMsMIMIC/"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;pepMMsMIMIC&lt;/span&gt;&lt;/span&gt;&lt;/a&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;will encourage medicinal chemists to de-peptidize protein-protein recognition processes of biological interest, thus increasing the potential of &lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;i&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt;in silico&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/i&gt;&lt;/span&gt;&lt;/span&gt;&lt;span style="font-size: small;"&gt;&lt;span lang="en-GB"&gt;&lt;span style="font-weight: normal;"&gt;&lt;span class="Apple-style-span" style="font-family: inherit;"&gt;&lt;span class="Apple-style-span" style="color: white;"&gt; peptidomimetic compound screening of known small molecules to expedite drug development.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-5576626019684576898?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/5576626019684576898/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/04/novel-web-oriented-peptidomimetic.html#comment-form' title='6 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/5576626019684576898'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/5576626019684576898'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/04/novel-web-oriented-peptidomimetic.html' title='A novel web-oriented peptidomimetic compound virtual screening tool.'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>6</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-7105927999317611898</id><published>2011-04-19T12:12:00.002+02:00</published><updated>2011-04-19T12:18:33.990+02:00</updated><title type='text'>MAISTAS: a tool for automatic structural evaluation of alternative splicing products</title><content type='html'>&lt;div style="color: #47382c; font: 14.0px Georgia; line-height: 18.0px; margin: 0.0px 0.0px 10.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;MAISTAS: a tool for automatic structural evaluation of alternative splicing products&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Matteo Floris 1, Domenico Raimondo 2, Guido Leoni 2, Massimiliano Orsini 1, Paolo Marcatili 2 and Anna Tramontano 3,4&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;*&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Author Affiliations&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;1 CRS4-Bioinformatics Laboratory, c/o Sardegna Ricerche Scientific Park, Pula, 09010 Cagliari, Italy&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;2 Department of Biochemical Sciences, Sapienza University of Rome, P.le A. Moro, 5 - 00185 Rome, Italy&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;3&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Department of Physics, Sapienza University of Rome, P.le A. Moro, 5 - 00185 Rome, Italy.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;4 Istituto Pasteur Fondazione Cenci Bolognetti, Sapienza University of Rome, P.le A. Moro, 5 - 00185 Rome, Italy.&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;*To whom correspondence should be addressed. Prof. Anna Tramontano, E-mail: anna.tramontano@uniroma1.it&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Received October 26, 2010&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Revision received March 17, 2011&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Accepted March 22, 2011&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: x-small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Bioinformatics (2011) doi: 10.1093/bioinformatics/btr198 First published online: April 15, 2011&amp;nbsp;&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;b&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Abstract&lt;/span&gt;&lt;/b&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Motivation: Analysis of the human genome revealed that the amount of transcribed sequence is an order of magnitude greater than the number of predicted and well characterized genes. A sizeable fraction of these transcripts is related to alternatively spliced forms of known protein coding genes. Inspection of the alternatively spliced transcripts identified in the pilot phase of the ENCODE project has clearly shown that often their structure might substantially differ from that of other isoforms of the same gene, and therefore that they might perform unrelated functions, or that they might even not correspond to a functional protein. Identifying these cases is obviously relevant for the functional assignment of gene products and for the interpretation of the effect of variations in the corresponding proteins.&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Results: Here we describe a publicly available tool that, given a gene or a protein, retrieves and analyses all its annotated isoforms, provides users with three-dimensional models of the isoform(s) of his/her interest whenever possible and automatically assesses whether homology derived structural models correspond to plausible structures. This information is clearly relevant. When the homology model of some isoforms of a gene does not seem structurally plausible, the implications are that either they assume a structure unrelated to that of the other isoforms of the same gene with presumably significant functional differences, or do not correspond to functional products. We provide indications that the second hypothesis is likely to be true for a substantial fraction of the cases.&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;&lt;br /&gt;&lt;/span&gt; &lt;/span&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div style="font: 20.0px Georgia; line-height: 26.0px; margin: 0.0px 0.0px 0.0px 0.0px;"&gt;&lt;div style="text-align: justify;"&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif;"&gt;&lt;span class="Apple-style-span" style="font-size: small;"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;Availability:&amp;nbsp;&lt;/span&gt;&lt;/span&gt;&lt;/span&gt;&lt;span class="Apple-style-span" style="font-family: Arial, Helvetica, sans-serif; font-size: small;"&gt;&lt;a href="http://maistas.bioinformatica.crs4.it/"&gt;&lt;span class="Apple-style-span" style="color: #eeeeee;"&gt;http://maistas.bioinformatica.crs4.it/&lt;/span&gt;&lt;/a&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-7105927999317611898?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://bioinformatics.oxfordjournals.org/content/early/2011/04/15/bioinformatics.btr198.abstract' title='MAISTAS: a tool for automatic structural evaluation of alternative splicing products'/><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/7105927999317611898/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/04/maistas-tool-for-automatic-structural.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7105927999317611898'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7105927999317611898'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/04/maistas-tool-for-automatic-structural.html' title='MAISTAS: a tool for automatic structural evaluation of alternative splicing products'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-6366834908466680526</id><published>2011-04-13T09:10:00.000+02:00</published><updated>2011-04-13T09:10:28.918+02:00</updated><title type='text'>Splicing isoforms modeling, peptidomimetics and molecular dynamic made easy</title><content type='html'>This new season is started with 3 new accepted papers. Here a brief introduction, I will give more details very soon for each of them:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Maìstas (Bioinformatics, &lt;i&gt;first name&lt;/i&gt;), a fully automatic pipeline aimed at building and assessing three-dimensional models for alternative splicing isoforms. The server builds, when possible, comparative structural models for all the splicing isoforms of a submitted gene or set of genes. The models are then analysed in terms of their suitability to exist in the monomeric state, i.e. when a warning appears in the model assessment, it cannot be excluded the possibility that other multimeric state may stabilize the structure.&amp;nbsp;Moreover, the splicing isoform exonic coordinates are mapped on the final models.&lt;/li&gt;&lt;li&gt;pep:MMs:MIMIC&amp;nbsp;(Nucleic Acid Research, Web Server Issue, &lt;i&gt;first name&lt;/i&gt;), a web-oriented tool that, given a peptide three-dimensional structure, is able to automate a multiconformers three-dimensional similarity search among 17 million of conformers calculated from 3.9 million of commercially available chemicals collected in the MMsINC database.&lt;/li&gt;&lt;li&gt;ClickMD&amp;nbsp;(Future Medicinal Chemistry), a web-based explicit solvent molecular dynamic simulator.&amp;nbsp;ClickMD performs minimization, equilibration phase and a short run of classical MD.&amp;nbsp;ClickMD works with PDB files of protein and peptides.&amp;nbsp;You just needs a valid PDB file to start the MD simulation!&amp;nbsp;You will receive an e-mail at the end of the simulation containing a link to a web page where you can download the MD results as: log files, trajectory files, energy and RMSD representations and graphs.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-6366834908466680526?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/6366834908466680526/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/04/splicing-isoforms-modeling.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/6366834908466680526'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/6366834908466680526'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/04/splicing-isoforms-modeling.html' title='Splicing isoforms modeling, peptidomimetics and molecular dynamic made easy'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-1233993173727550213</id><published>2011-01-04T16:17:00.000+01:00</published><updated>2011-01-04T16:17:26.967+01:00</updated><title type='text'>Job position: modeling the interaction of genetic and environmental factors in autoimmune diseases</title><content type='html'>Not exactly drug design (not yet): a grant is available for modeling the interaction of genetic + environmental factors in autoimmune diseases. Deadline for application is Jan 11, 12AM Italy timezone.&lt;br /&gt;&lt;br /&gt;It will be cooordinated by CRS4 in collaboration with two clinical units for MS and DT1 in Cagliari Hospitals and Biomed Dept for Chron disease in Sassari. Info in Italian at &lt;a href="http://www.unica.it/UserFiles/File/Selezioni/SciCardio%20N.%206.doc"&gt;http://www.unica.it/UserFiles/File/Selezioni/SciCardio%20N.%206.doc&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Please send directly an email to Enrico Pieroni if interested (&lt;a href="mailto:ep@crs4.it"&gt;ep@crs4.it&lt;/a&gt;).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-1233993173727550213?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/1233993173727550213/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/01/job-position-modeling-interaction-of.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/1233993173727550213'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/1233993173727550213'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2011/01/job-position-modeling-interaction-of.html' title='Job position: modeling the interaction of genetic and environmental factors in autoimmune diseases'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-4158507547524938341</id><published>2010-12-23T19:05:00.003+01:00</published><updated>2010-12-23T19:19:05.948+01:00</updated><title type='text'>How to download MMsINC entries for Autodock.</title><content type='html'>&lt;a href="http://autodock.scripps.edu/"&gt;AutoDock&lt;/a&gt; is a &lt;blockquote&gt;suite of automated docking tools. It is designed to predict how small molecules, such as substrates or drug candidates, bind to a receptor of known 3D structure&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;AutoDock 4 uses PDBQT formatted files not only for the receptor but also for the ligand.&lt;br /&gt;The PDBQT format is described &lt;a href="http://autodock.scripps.edu/faqs-help/faq/what-is-the-format-of-a-pdbqt-file"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Yesterday I have converted all the MMsINC entries (about 4M) to the PDBQT format.&lt;br /&gt;It is so possible to download each entry as input format for Autodock experiments.&lt;br /&gt;&lt;br /&gt;Have a look at this &lt;a href="http://mms.dsfarm.unipd.it/MMsINC/search/molecule.php?mmscode=MMs02218121"&gt;example page&lt;/a&gt;. As you can see, at the top of the page there is a menu for the download of several format files. The first of them is Autodock input file. Simply press the "go" button to download it.&lt;br /&gt;&lt;br /&gt;Let me close this post with this nice photo.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_eK36skX7Mfk/TROSBvWRv4I/AAAAAAAAADo/948Ir_2gQLI/s1600/foto_ricci_poetto.JPG"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 240px;" src="http://1.bp.blogspot.com/_eK36skX7Mfk/TROSBvWRv4I/AAAAAAAAADo/948Ir_2gQLI/s320/foto_ricci_poetto.JPG" border="0" alt="" id="BLOGGER_PHOTO_ID_5553943324065382274" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Where: the place is the Poetto Beach in Cagliari (Sardinia, Italy).&lt;br /&gt;What: Stefano Moro and me eating these "ricci di mare" (&lt;a href="http://en.wikipedia.org/wiki/Sea_urchin"&gt;Sea urchin&lt;/a&gt;).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-4158507547524938341?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/4158507547524938341/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/12/how-to-download-mmsinc-entries-for.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/4158507547524938341'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/4158507547524938341'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/12/how-to-download-mmsinc-entries-for.html' title='How to download MMsINC entries for Autodock.'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_eK36skX7Mfk/TROSBvWRv4I/AAAAAAAAADo/948Ir_2gQLI/s72-c/foto_ricci_poetto.JPG' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-2678472398995497960</id><published>2010-12-16T10:31:00.002+01:00</published><updated>2010-12-16T10:56:55.390+01:00</updated><title type='text'>The hard life of a chemoinformatician (and the chemical space in a bag)</title><content type='html'>Part 1: &lt;span style="font-style:italic;"&gt;The hard life of a chemoinformatician&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Thoughts at the end of the Year 2010.&lt;br /&gt;PhD going to be completed in the next 2 months.&lt;br /&gt;My contract is coming to the end (February).&lt;br /&gt;I would like to do more cheminfo than bioinfo.&lt;br /&gt;But this is not easy, also because there are more opportunities in bioinfo than in cheminfo.&lt;br /&gt;&lt;br /&gt;Part 2: &lt;span style="font-style:italic;"&gt;the chemical space in a bag&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;Stefano Moro will be wellcome to my village during next weekend.&lt;br /&gt;In his bag, there will be the whole public chemical space (the future MMsINC 2.0).&lt;br /&gt;I will enjoy exploring that space during the holidays.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-2678472398995497960?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/2678472398995497960/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/12/hard-life-of-chemoinformatician-and.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/2678472398995497960'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/2678472398995497960'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/12/hard-life-of-chemoinformatician-and.html' title='The hard life of a chemoinformatician (and the chemical space in a bag)'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-6884186530102789999</id><published>2010-11-08T12:53:00.004+01:00</published><updated>2010-11-08T13:03:13.403+01:00</updated><title type='text'>Who is using MMsINC?</title><content type='html'>A good number of people are using &lt;a href="http://mms.dsfarm.unipd.it/MMsINC/search/"&gt;MMsINC&lt;/a&gt; (summary from May to October 2010), &lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_eK36skX7Mfk/TNfmgQaR-_I/AAAAAAAAADY/bqmN4VrTf6I/s1600/totali.gif"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 70px;" src="http://4.bp.blogspot.com/_eK36skX7Mfk/TNfmgQaR-_I/AAAAAAAAADY/bqmN4VrTf6I/s320/totali.gif" border="0" alt=""id="BLOGGER_PHOTO_ID_5537147708710714354" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;but what is interesting is the geographical distribution of the visitors: of the last 500 visitors, 67% were from India.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_eK36skX7Mfk/TNfmvB7l-_I/AAAAAAAAADg/sCBW7ShzQKA/s1600/visite_per_paese.gif"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 233px;" src="http://3.bp.blogspot.com/_eK36skX7Mfk/TNfmvB7l-_I/AAAAAAAAADg/sCBW7ShzQKA/s320/visite_per_paese.gif" border="0" alt=""id="BLOGGER_PHOTO_ID_5537147962521943026" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-6884186530102789999?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/6884186530102789999/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/11/who-is-using-mmsinc.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/6884186530102789999'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/6884186530102789999'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/11/who-is-using-mmsinc.html' title='Who is using MMsINC?'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_eK36skX7Mfk/TNfmgQaR-_I/AAAAAAAAADY/bqmN4VrTf6I/s72-c/totali.gif' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-7312863340812832476</id><published>2010-09-17T14:30:00.002+02:00</published><updated>2010-09-17T14:40:22.999+02:00</updated><title type='text'>Pharao installation</title><content type='html'>&lt;a href="http://www.silicos.be/news.html"&gt;Few days ago&lt;/a&gt; Silicos released the source code of the tool &lt;a href="http://www.ncbi.nlm.nih.gov/pubmed/18485770"&gt;Pharao&lt;/a&gt;.&lt;br /&gt;Here some notes about the installation of Pharao.&lt;br /&gt;I had some problems, and with the help of the Pharao developers (Gert Thijs) I understood how to install it properly.&lt;br /&gt;&lt;br /&gt;1) install &lt;a href="http://www.cmake.org/cmake/resources/software.html"&gt;cmake&lt;/a&gt;&lt;br /&gt;2) install the &lt;a href="http://openbabel.org/wiki/Subversion"&gt;latest openbabel release via svn&lt;/a&gt;&lt;br /&gt;3) follow the installation of openbabel &lt;a href="http://openbabel.org/wiki/CMake"&gt;here&lt;/a&gt;&lt;br /&gt;4) download and install &lt;a href="http://www.silicos.be/download.html"&gt;pharao&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;If everything worked fine, there should not be any problem.&lt;br /&gt;&lt;br /&gt;See also &lt;a href="http://blueobelisk.shapado.com/questions/pharao-installation-anyone-else-installed-it"&gt;BO exchange&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-7312863340812832476?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/7312863340812832476/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/09/pharao-installation.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7312863340812832476'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7312863340812832476'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/09/pharao-installation.html' title='Pharao installation'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-2765006097988019501</id><published>2010-09-12T12:14:00.008+02:00</published><updated>2010-09-12T14:46:55.709+02:00</updated><title type='text'>Automated large-scale protein modeling</title><content type='html'>A pipeline for the multiple automated comparative modeling can be &lt;span style="font-style:italic;"&gt;easily&lt;/span&gt; built with the following software:&lt;br /&gt;&lt;br /&gt;1) the best template (based on some filters such as the e-value and the resolution) is identified using the &lt;a href="http://toolkit.tuebingen.mpg.de/hhpred"&gt;hhsearch&lt;/a&gt; program; &lt;br /&gt;2) &lt;a href="http://www.salilab.org/modeller/"&gt;Modeller&lt;/a&gt; is used for the comparative modeling;&lt;br /&gt;3) databases for the template search (nr90 and nr70) and for the modeling (a reformatted version of the &lt;a href="http://www.rcsb.org/pdb/home/home.do"&gt;PDB&lt;/a&gt;) are available from the hhsearch ftp site;&lt;br /&gt;4) a set of python and bash utilities for the management of the jobs on a computer cluster.&lt;br /&gt;&lt;br /&gt;All these building block are part of my pipeline that is going to be released.&lt;br /&gt;I'll give more informations soon.&lt;br /&gt;&lt;br /&gt;Some days ago a test experiment revealed that the pipeline can build 450 models in 9 hours (50 models per hour).&lt;br /&gt;Not so fast, but my pipeline contains also come modules for the model assessment and the cluster resources are shared with a lot of different users.&lt;br /&gt;&lt;br /&gt;With a dedicated (and larger) cluster, I suppose it would be possible to model the whole human proteome (ca. 78.000 peptides, source: &lt;a href="http://www.ensembl.org"&gt;Ensembl&lt;/a&gt;) in 1 or 2 weeks with this pipeline.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-2765006097988019501?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/2765006097988019501/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/09/automated-large-scale-protein-modeling.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/2765006097988019501'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/2765006097988019501'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/09/automated-large-scale-protein-modeling.html' title='Automated large-scale protein modeling'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-5299563949669652827</id><published>2010-08-23T17:10:00.006+02:00</published><updated>2010-08-23T17:36:18.732+02:00</updated><title type='text'>MMsINC 2.0: coming soon.</title><content type='html'>We are processing (mainly &lt;a href="http://mms.dsfarm.unipd.it/mfanton.htm"&gt;Marco Fanton at Univ. of Padova&lt;/a&gt;) lot of public sources for the next MMsINC release; please feel free to contact me if you have any SDF catalog, it will be our pleasuse to process and incorporate it (with the appropriate link to your Company or Institute).&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div align="center"&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.sardegnadigitallibrary.it/mmt/480/63135.jpg"&gt;&lt;img style="float:center;margin:0 0 10px 10px;cursor:pointer; cursor:hand;width: 336px; height: 226px;" src="http://www.sardegnadigitallibrary.it/mmt/480/63135.jpg" border="0" alt="filigrana sarda" /&gt;&lt;/a&gt;&lt;br&gt;&lt;a href="http://www.sardegnadigitallibrary.it/index.php?xsl=626&amp;id=63135"&gt;Sardinian gold jewel&lt;/a&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-5299563949669652827?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/5299563949669652827/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/08/mmsinc-20-coming-soon.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/5299563949669652827'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/5299563949669652827'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/08/mmsinc-20-coming-soon.html' title='MMsINC 2.0: coming soon.'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-4102911598485704783</id><published>2010-08-19T14:32:00.003+02:00</published><updated>2010-08-19T14:39:31.928+02:00</updated><title type='text'>"Ultrafast shape recognition" method implementation</title><content type='html'>I have implemented the Ultrafast Shape Recognition method (&lt;a href="http://rspa.royalsocietypublishing.org/content/463/2081/1307.short"&gt;Ballester and Richards, Proc. R. Soc. A., 2007&lt;/a&gt;) for the next MMsINC release... pure Python implementation, no external libraries required, fast calculation. I'm looking for a small dataset for the validation of my script. If you are interested in the source code, please send me an email at matteo.floris@gmail.com.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-4102911598485704783?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/4102911598485704783/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/08/ultrafast-shape-recognition-method.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/4102911598485704783'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/4102911598485704783'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/08/ultrafast-shape-recognition-method.html' title='&quot;Ultrafast shape recognition&quot; method implementation'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-7933970201990615522</id><published>2010-07-01T17:04:00.001+02:00</published><updated>2010-07-01T17:06:13.271+02:00</updated><title type='text'>InChI Version 1, Software Version 1.03 is out!</title><content type='html'>&lt;a href="http://www.iupac.org/inchi/release103.html"&gt;http://www.iupac.org/inchi/release103.html&lt;/a&gt;: &lt;br /&gt;&lt;blockquote&gt;InChI Version 1, Software Version 1.03 &lt;br /&gt;– implemented for both Standard and &lt;br /&gt;Non-standard (Customized) InChI/InChIKey&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-7933970201990615522?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/7933970201990615522/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/07/inchi-version-1-software-version-103-is.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7933970201990615522'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7933970201990615522'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/07/inchi-version-1-software-version-103-is.html' title='InChI Version 1, Software Version 1.03 is out!'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-7302670975672389159</id><published>2010-06-23T08:54:00.003+02:00</published><updated>2010-06-23T09:08:07.942+02:00</updated><title type='text'>Removing duplicates from large SDF files</title><content type='html'>Maybe there are better solutions, but this worked very well with a random set taken from Pubchem (5.000.000 structures, but I introduced random duplicates, for a total of 120.000.000 structures):&lt;br /&gt;1) generate your preferred inchis for all the structures in your big SDF, and update the SDF with these inchis (you can use pybel for that)&lt;br /&gt;2) extract PUBCHEM_COMPOUND_CID from the SDF:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;grep PUBCHEM_COMPOUND_CID -A 1 big.sdf &gt; PUBCHEM_COMPOUND_CID | grep -v "PUBCHEM_COMPOUND_CID" | grep -v "-" &gt; CIDs.txt&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;3) then put inchis and CIDs in the same file:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;paste inchi CIDs.txt &gt; inchi_CIDs.txt&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;4) now you can sort this file:&lt;br /&gt;&lt;blockquote&gt;&lt;br /&gt;sort inchi_CID.txt -o inchi_CID_sort.txt&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;so, all the duplicates are visible...&lt;br /&gt;5) now, you could load all the inchis as keys of a python cPickle dictionary... if an inchi is unique in the inchi_CID_sort.txt file, the value of the key is 0, if it is a duplicate (last visited inchi == actual inchi) then the value of the key is 10.&lt;br /&gt;6) now, the python script should parse the SDF in this way:&lt;br /&gt;for each structure:&lt;br /&gt;if the inchi of this structure has value 0 in the dictionary, save the molecule;&lt;br /&gt;if the value is 10, save the molecule, but change the value to 11;&lt;br /&gt;if the value is 11, skip this structure&lt;br /&gt;&lt;br /&gt;I would suggest to save the output file every 100.000 structures, the open a different output file at each iteration... at the end, a "cat" command will generate a big SDF without duplicates.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-7302670975672389159?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/7302670975672389159/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/06/removing-duplicates-from-large-sdf.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7302670975672389159'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7302670975672389159'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/06/removing-duplicates-from-large-sdf.html' title='Removing duplicates from large SDF files'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-7982761070119879013</id><published>2010-05-13T10:31:00.012+02:00</published><updated>2010-05-13T10:50:51.877+02:00</updated><title type='text'>Which is the "real" RU-486? [2]</title><content type='html'>From a Pubchem search (search term: RU-486, 20 results, 18 RO5):&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=44327040"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=44327040" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=44372311"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=44372311" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=44327059"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=44327059" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=18649237"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=18649237" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=55245"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=55245" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=16758830"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=16758830" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=11743390"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=11743390" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=10503032"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=10503032" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=10387949"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=10387949" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=9910521"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=9910521" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=7048710"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=7048710" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=7048709"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=7048709" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=7048587"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=7048587" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=6712024"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=6712024" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=6426861"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=6426861" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=6604870"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=6604870" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=6604445"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=6604445" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=6569219"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=6569219" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=1756360"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=1756360" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://pubchem.ncbi.nlm.nih.gov/summary/summary.cgi?cid=4196"&gt;&lt;img style="margin:0px auto 10px; text-align:left;cursor:pointer; cursor:hand;width: 300px; height: 300px;" src="http://pubchem.ncbi.nlm.nih.gov/image/imgsrv.fcgi?t=l&amp;cid=4196" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-7982761070119879013?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/7982761070119879013/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/05/which-is-real-ru-486-2.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7982761070119879013'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7982761070119879013'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/05/which-is-real-ru-486-2.html' title='Which is the &quot;real&quot; RU-486? [2]'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-6438399712050189842</id><published>2010-05-07T18:56:00.006+02:00</published><updated>2010-05-07T19:07:54.658+02:00</updated><title type='text'>Which is the "real" RU-486?</title><content type='html'>This&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.ebi.ac.uk/chebi/displayImage.do?defaultImage=true&amp;imageIndex=0&amp;chebiId=419138"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 200px; height: 200px;" src="http://www.ebi.ac.uk/chebi/displayImage.do?defaultImage=true&amp;imageIndex=0&amp;chebiId=419138" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;or this&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.ebi.ac.uk/chebi/displayImage.do;jsessionid=6DEF27B723C7A71662AE2E0AED3367C4?defaultImage=true&amp;imageIndex=0&amp;chebiId=363012"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 200px; height: 200px;" src="http://www.ebi.ac.uk/chebi/displayImage.do;jsessionid=6DEF27B723C7A71662AE2E0AED3367C4?defaultImage=true&amp;imageIndex=0&amp;chebiId=363012" border="0" alt="" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;???&lt;br /&gt;These images are from ChEBI: &lt;a href="http://www.ebi.ac.uk/chebi/searchId.do?chebiId=363012"&gt;http://www.ebi.ac.uk/chebi/searchId.do?chebiId=363012&lt;/a&gt; and &lt;a href="http://www.ebi.ac.uk/chebi/searchId.do?chebiId=363012"&gt;http://www.ebi.ac.uk/chebi/searchId.do?chebiId=363012&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-6438399712050189842?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/6438399712050189842/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/05/which-is-real-ru-486.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/6438399712050189842'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/6438399712050189842'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/05/which-is-real-ru-486.html' title='Which is the &quot;real&quot; RU-486?'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-1301283847025743346</id><published>2010-04-22T14:11:00.001+02:00</published><updated>2010-04-22T14:12:53.072+02:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='chembl'/><title type='text'>ChEMBL_03 is available!</title><content type='html'>&lt;a href="http://www.ebi.ac.uk/chembldb/index.php"&gt;JPO announced the new release&lt;/a&gt;. FTP data will be available in the next few days.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-1301283847025743346?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/1301283847025743346/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/04/chembl03-is-available.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/1301283847025743346'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/1301283847025743346'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/04/chembl03-is-available.html' title='ChEMBL_03 is available!'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-7071576353962385661</id><published>2010-03-25T11:26:00.010+01:00</published><updated>2010-03-25T11:49:06.656+01:00</updated><title type='text'>Any other case of "Different InChIs from the Same Molecule"?: an experiment with MMsINC 1.0</title><content type='html'>Some days ago, an InChI bug was highlighted by Rich Apodaca on &lt;a href="http://depth-first.com/articles/2010/03/11/significant-inchi-issue-two-different-inchis-from-the-same-molecule"&gt;his blog&lt;/a&gt;:&lt;br /&gt;&lt;br /&gt;me and Marco Fanton (see &lt;a href="http://mms.dsfarm.unipd.it/mfanton.htm"&gt;Marco Fanton web page&lt;/a&gt;) decided to perform an experiment with the 3M of 3D structures from MMsINC 1.0: we have created a small pure-Python script that is able to reshuffle the atom order of a given SDF file (you can write me if you are interested in the code), then we generated "on-the-fly" 10 random permutations for each of the 3M structures, automatically calculated the standard InChI, and searched for any "new" duplicated InChI.&lt;br /&gt;Results? Interesting:&lt;br /&gt;&lt;br /&gt;two molecules (&lt;a href="http://mms.dsfarm.unipd.it/MMsINC/search/molecule.php?mmscode=MMs03263666"&gt;MMs03263666 &lt;/a&gt;, &lt;a href="http://mms.dsfarm.unipd.it/MMsINC/search/molecule.php?mmscode=MMs03263667"&gt;MMs03263667&lt;/a&gt;) that are di-azo compounds (and this confirms the known bug)... and no other duplicates.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-7071576353962385661?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/7071576353962385661/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/any-other-case-of-different-inchis-from.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7071576353962385661'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7071576353962385661'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/any-other-case-of-different-inchis-from.html' title='Any other case of &quot;Different InChIs from the Same Molecule&quot;?: an experiment with MMsINC 1.0'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-2417572695081698752</id><published>2010-03-10T16:25:00.007+01:00</published><updated>2010-03-10T17:12:29.936+01:00</updated><title type='text'>About MMsINC data and license</title><content type='html'>As a comment of a &lt;a href="http://alchemoinformatics.blogspot.com/2010/02/mmsinc.html#comments"&gt;recent post&lt;/a&gt;, Egon posted two short questions about MMsINC data and license.&lt;br /&gt;I want to thank Egon for the two questions.&lt;br /&gt;&lt;br /&gt;QUESTION 1: &lt;span style="font-weight:bold;"&gt;what's the license of the data in the database&lt;/span&gt;?&lt;br /&gt;&lt;br /&gt;ANSWER: MMsINC data are property of the University of Padova.&lt;br /&gt;Actually, data are not available for download, but the users can query them though the web interface.&lt;br /&gt;&lt;br /&gt;QUESTION 2: &lt;span style="font-weight:bold;"&gt;how does the curation compare to that of ChEBI&lt;/span&gt;?&lt;br /&gt;&lt;br /&gt;ANSWER:&lt;br /&gt;a. the MMsINC data sources are larger than those included in ChEBI: first release contains a number of sources but the most relevant is Zinc (version 7), but the next release aims to process the greater part of public data (Pubchem, mainly).&lt;br /&gt;b. structures are not checked or processed one by one, but by following a precise &lt;a href="http://nar.oxfordjournals.org/cgi/content/abstract/gkn727v1"&gt;protocol described here&lt;/a&gt; (open access NAR paper).&lt;br /&gt;&lt;br /&gt;Some additional notes about the quality of MMsINC data:&lt;br /&gt;&lt;br /&gt;the quality of MMsINC chemical data is higher than other public resources: &lt;br /&gt;&lt;br /&gt;- MMsINC is not just a collection of public data, but there is a long preprocessing work (see the Nucleic Acid Research DBIssue article for a &lt;a href="http://nar.oxfordjournals.org/cgi/content/full/37/suppl_1/D284"&gt;full description of the pipeline&lt;/a&gt;) and a data cleaning based on the InChIs.&lt;br /&gt;- MMsINC is the only resource that collect the most probable ionic states and tautomers of all the structures (when possible and with the known limitation)&lt;br /&gt;- MMsINC stores precalculated predictions of biological enrichment of each molecules (similarity to PDB ligand, to bioactive molecules, presence of "active" fragments)&lt;br /&gt;- MMsINC contains a selection of descriptors important from a pharmaceutical and biochemical point of view&lt;br /&gt;&lt;br /&gt;I want also to cite Stefano (&lt;a href="http://mms.dsfarm.unipd.it/"&gt;Prof. Stefano Moro, University of Padova, Italy&lt;/a&gt;):&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;MMsINC is not only a database: it is a chemogenomics work platform that places&lt;br /&gt;its data and tools to work with it on an even footing.  Although the data &lt;br /&gt;formally is property of the University of Padova, it is more important to note&lt;br /&gt;that we feel that simply providing files to download would belittle our&lt;br /&gt;mission.  Instead, we aim to bring this data and the science of chemoinformatics&lt;br /&gt;together to provide the MMsINC service to our web community.&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;Let me thank &lt;a href="http://migweb.crs4.it/system/files/lucas_cv.pdf"&gt;Luca Pireddu&lt;/a&gt; for helping me translate Prof. Moro's quote.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-2417572695081698752?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/2417572695081698752/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/about-mmsinc-data-and-license.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/2417572695081698752'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/2417572695081698752'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/about-mmsinc-data-and-license.html' title='About MMsINC data and license'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-963562645974215673</id><published>2010-03-06T14:22:00.002+01:00</published><updated>2010-03-06T14:25:17.078+01:00</updated><title type='text'>OOChemistry</title><content type='html'>This is an interesting tool.&lt;br /&gt;I want to report here the message from the developers:&lt;br /&gt;&lt;br /&gt;&lt;blockquote&gt;OOChemistry is an extension for OpenOffice.org which provides cross-platform OLE-like integration of OOo with JChemPaint chemical diagram editor. With OOChemistry you can draw structure, embed into document (text or presentation) and than double click and edit whenever you want on any platform having OpenOffice.org and Java Runtime (Windows, Linux, Mac OS X, other Unix flavours). It is only first alpha version and is not recommended for production use (e.g., compatibility with futher versions is not guaranteed).&lt;br /&gt;&lt;br /&gt;OOChemistry needs your help! Experience in Java, in development of projects dealing with JChemPaint/CDK, or in development of OpenOffice.org extensions will be highly appreciated. Of course, you can help not only in coding, but also in translation of interface and writing docs.&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://sourceforge.net/projects/oochemistry/develop/"&gt;Project page on SF&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;It sounds very interesting.&lt;br /&gt;I'll try the installation asap.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-963562645974215673?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/963562645974215673/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/oochemistry.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/963562645974215673'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/963562645974215673'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/oochemistry.html' title='OOChemistry'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-4181202295647445445</id><published>2010-03-05T11:26:00.003+01:00</published><updated>2010-03-05T11:30:59.207+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='CDK'/><title type='text'>c-d-k.org</title><content type='html'>Good news: &lt;a href="http://www.chemistry-development-kit.org/"&gt;c-d-k.org&lt;/a&gt; is again available!; it is a good way to be introduced to the CDK functionalities.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-4181202295647445445?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/4181202295647445445/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/c-d-korg.html#comment-form' title='7 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/4181202295647445445'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/4181202295647445445'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/c-d-korg.html' title='c-d-k.org'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>7</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-646140524775270208</id><published>2010-03-03T10:12:00.002+01:00</published><updated>2010-03-03T10:15:32.462+01:00</updated><title type='text'>About SMARTS patterns in PubChem fingerprints</title><content type='html'>&lt;a href="http://blueobelisk.stackexchange.com/questions/280/pubchem-structural-keys-why-those-smarts-patterns"&gt;See my recent post at BlueObelisk StackExchange&lt;/a&gt;.&lt;br /&gt;Wolf Ihlenfeldt's reply is very interesting for people that want to know more about PubChem fingerprints. &lt;br /&gt;As he says, the SMARTS patterns (terminal part of the FPs)...&lt;br /&gt;&lt;br /&gt;"&lt;span style="font-style:italic;"&gt;are the result of capturing and analyzing user queries on the old NCI Cancer Screening database Web interface. They are intended to capture features which are used in actual queries. They were not designed specifically for similarities or correlation with any properties, but it turns out that there are indications that these screens work about as well as others in that sector&lt;/span&gt;".&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-646140524775270208?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/646140524775270208/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/about-smarts-patterns-in-pubchem.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/646140524775270208'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/646140524775270208'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/03/about-smarts-patterns-in-pubchem.html' title='About SMARTS patterns in PubChem fingerprints'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-716510599297263053</id><published>2010-02-27T13:23:00.008+01:00</published><updated>2010-03-01T13:26:56.798+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='rcdk'/><category scheme='http://www.blogger.com/atom/ns#' term='fingerprints'/><category scheme='http://www.blogger.com/atom/ns#' term='Tanimoto'/><category scheme='http://www.blogger.com/atom/ns#' term='clustering'/><category scheme='http://www.blogger.com/atom/ns#' term='Python'/><title type='text'>Clustering with rcdk and Python</title><content type='html'>I received an email from a collegue: "I have a list of ChEBI IDs with the corresponding SMILES; I'd like to do some clustering, based on a similarity measure between the structures".&lt;br /&gt;&lt;br /&gt;I want to propose here a method based on the &lt;a href="cran.r-project.org/web/packages/rcdk/index.html"&gt;&lt;span style="font-style:italic;"&gt;rcdk&lt;/span&gt; package&lt;/a&gt; developed by R. Guha. This package is really nice. You should read carefully &lt;a href="http://cran.r-project.org/web/packages/rcdk/vignettes/rcdk.pdf"&gt;this article&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Let me suggest a variation. If you have a huge number of structures, I would suggest to create externally from R your matrix of similarities.&lt;br /&gt;&lt;br /&gt;This can be done with a Python script:&lt;br /&gt;1) you can create your structural keys with an external tool (or, with the &lt;span style="font-style:italic;"&gt;rcdk&lt;/span&gt; and save the fingerprints in another file)&lt;br /&gt;2) then, you can calculate the Tanimoto similarity by using functions from the Python &lt;span style="font-style:italic;"&gt;sets&lt;/span&gt; package:&lt;br /&gt;&lt;br /&gt;&lt;pre&gt;&lt;br /&gt;import sys, os&lt;br /&gt;from sets import Set&lt;br /&gt;&lt;br /&gt;fp_A = list("110011")&lt;br /&gt;fp_B = list("101011")&lt;br /&gt;&lt;br /&gt;set_a, set_b = Set([]), Set([])&lt;br /&gt;i = -1&lt;br /&gt;try:&lt;br /&gt; while 1:&lt;br /&gt;  i = fp_A.index("1", i+1)&lt;br /&gt;  set_a.add(i)&lt;br /&gt;except ValueError: pass&lt;br /&gt;i = -1&lt;br /&gt;try:&lt;br /&gt; while 1:&lt;br /&gt;  i = fp_B.index("1", i+1)&lt;br /&gt;  set_b.add(i)&lt;br /&gt;except ValueError: pass&lt;br /&gt;&lt;br /&gt;tanimoto = float( len(set_a.intersection(set_b)) ) / float( len(set_a.union(set_b)) )&lt;br /&gt;&lt;/pre&gt;&lt;br /&gt;&lt;br /&gt;3) in this way, you can calculate the matrix of similarities between all your structures, save the matrix in a file, load it in R environment, and use &lt;span style="font-style:italic;"&gt;rcdk&lt;/span&gt; for the clustering.&lt;br /&gt;&lt;br /&gt;That's all.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-716510599297263053?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/716510599297263053/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/how-to-cluster-your-structures.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/716510599297263053'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/716510599297263053'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/how-to-cluster-your-structures.html' title='Clustering with rcdk and Python'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-7841126090525615855</id><published>2010-02-25T13:39:00.002+01:00</published><updated>2010-02-25T13:43:40.266+01:00</updated><title type='text'>Chemoinformatics in R:</title><content type='html'>Really interesting if you want to learn more about R programming applied to chemoinformatics: a Joint EBI-Industry Workshop on &lt;span style="font-weight:bold;"&gt;Cheminformatics in R&lt;/span&gt;.&lt;br /&gt;Speakers of this short course:&lt;br /&gt;- Rajarshi Guha, NIH Chemical Genomics Center (R-CDK and R-Pubchem) &lt;br /&gt;- Steffen Neumann, AG Massenspektrometrie &amp; Bioinformatik ( XCMS, Rdisop, CAMERA) &lt;br /&gt;- H. Paul Benton, Imperial College London. &lt;br /&gt;- David Broadhurst, Cork University Maternity Hospital.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.ebi.ac.uk/industry/Workshops/CheminformaticsR170510.html"&gt;Course page&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-7841126090525615855?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/7841126090525615855/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/chemoinformatics-in-r.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7841126090525615855'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7841126090525615855'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/chemoinformatics-in-r.html' title='Chemoinformatics in R:'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-7714448263507554849</id><published>2010-02-24T09:45:00.002+01:00</published><updated>2010-02-24T11:26:30.326+01:00</updated><title type='text'>New data load for kinase SARfari screening data</title><content type='html'>As JPO reported on the ChEMBL Blog, there is a new data load for the beta version of Kinase SARfari.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://chembl.blogspot.com/2010/02/beta-testing-new-version-of-kinase.html"&gt;See the post at ChEMBL blog&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-7714448263507554849?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/7714448263507554849/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/new-data-load-for-kinase-sarfari.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7714448263507554849'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/7714448263507554849'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/new-data-load-for-kinase-sarfari.html' title='New data load for kinase SARfari screening data'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-2443394801388073203</id><published>2010-02-18T12:28:00.001+01:00</published><updated>2010-02-18T12:29:56.829+01:00</updated><title type='text'>MolPaint</title><content type='html'>&lt;a href="http://mms.dsfarm.unipd.it/mmsinc/molpaint/"&gt;molpaint&lt;/a&gt; - the MMsINC molecular paint tool, based on &lt;a href="http://opensource.scitouch.net/indigo/dingo"&gt;DINGO&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-2443394801388073203?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/2443394801388073203/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/molpaint.html#comment-form' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/2443394801388073203'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/2443394801388073203'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/molpaint.html' title='MolPaint'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-8564110759306030034</id><published>2010-02-09T14:39:00.006+01:00</published><updated>2010-02-10T10:59:41.731+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='descriptors'/><title type='text'>Looking for Open Source Chemical Descriptors</title><content type='html'>Just asked a question about this point on the &lt;a href="http://blueobelisk.stackexchange.com/questions/214/what-are-the-best-free-tools-for-molecular-descriptors-calculation"&gt;Blue Obelisk Exchange&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;With RCDK is very easy to get a good number of descriptors with few rows of code:&lt;br /&gt;&lt;br /&gt;&lt;pre&gt;&lt;br /&gt;library(rcdk)&lt;br /&gt;dn &lt;- get.desc.names(dc[1])&lt;br /&gt;mol &lt;- parse.smiles("c1ccccc1")&lt;br /&gt;descNames &lt;- unique(unlist(sapply(get.desc.categories(), get.desc.names)))&lt;br /&gt;descs &lt;- eval.desc(mol, descNames)&lt;br /&gt;&lt;/pre&gt;&lt;br /&gt;&lt;br /&gt;Nice!, 288 descriptors (but 65 of them are "NA").&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-8564110759306030034?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/8564110759306030034/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/looking-for-open-source-chemical.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/8564110759306030034'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/8564110759306030034'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/looking-for-open-source-chemical.html' title='Looking for Open Source Chemical Descriptors'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-8515672705544970278</id><published>2010-02-09T13:51:00.005+01:00</published><updated>2010-02-09T13:58:26.217+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='substructure search'/><category scheme='http://www.blogger.com/atom/ns#' term='MMsINC'/><category scheme='http://www.blogger.com/atom/ns#' term='fragmentation'/><category scheme='http://www.blogger.com/atom/ns#' term='database'/><title type='text'>Database Indexing for a faster Substructure Search</title><content type='html'>In our first release of &lt;a href="http://mms.dsfarm.unipd.it/MMsINC/search/"&gt;MMsINC&lt;/a&gt; (see &lt;a href="http://alchemoinformatics.blogspot.com/2010/02/mmsinc.html"&gt;recent post&lt;/a&gt;) we developed a strategy that can help in making faster a substructure search in huge databases.&lt;br /&gt;&lt;br /&gt;Rules:&lt;br /&gt;1) choose a fragmentation algorithm&lt;br /&gt;2) fragment all the compounds in your database&lt;br /&gt;3) store the fragments in your database&lt;br /&gt;&lt;br /&gt;Then, when the user submits a query, you can apply your fragmentation tool to the query compoud.&lt;br /&gt;If you are lucky, you can restrict the search space to the database compounds that share the same fragments of the query.&lt;br /&gt;You can then apply the exact substructure search to this reduced set of compounds.&lt;br /&gt;&lt;br /&gt;You need more disk space, but in this way you can save computation time.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-8515672705544970278?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/8515672705544970278/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/database-indexing-for-faster.html#comment-form' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/8515672705544970278'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/8515672705544970278'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/database-indexing-for-faster.html' title='Database Indexing for a faster Substructure Search'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8942442068905528713.post-9157774816661561342</id><published>2010-02-08T10:43:00.001+01:00</published><updated>2010-02-08T10:44:53.395+01:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='chemoinformatics'/><category scheme='http://www.blogger.com/atom/ns#' term='MMsINC'/><category scheme='http://www.blogger.com/atom/ns#' term='virtual screening'/><category scheme='http://www.blogger.com/atom/ns#' term='database'/><title type='text'>MMsINC</title><content type='html'>What is MMsINC?&lt;br /&gt;&lt;br /&gt;MMsINC is a database of non-redundant, richly annotated, and biomedically relevant chemical structures. &lt;br /&gt;&lt;br /&gt;A primary goal of MMsINC is to guarantee the highest quality and the uniqueness of each entry. MMsINC then adds value to these entries by including the analysis of crucial chemical properties such as ionization and tautomerization processes, and the in silico prediction of 24 important molecular properties in the biochemical profile of each structure. MMsINC is consequently a natural input for different chemoinformatics and virtual screening applications. In addition, MMsINC supports various types of queries, including substructure queries and the novel "molecular scissoring" query. &lt;br /&gt;&lt;br /&gt;MMsINC is interfaced with other primary data collectors such as PubChem, Protein Data Bank (PDB), the Food and Drug Administration (FDA) database of approved drugs, and ZINC.&lt;br /&gt;&lt;br /&gt;The current database contains about 4 million unique compounds. For all the molecules, we calculate 24 molecular properties useful for quantitative structure-activity relationship (QSAR), diversity analysis or combinatorial library design.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8942442068905528713-9157774816661561342?l=alchemoinformatics.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://mms.dsfarm.unipd.it/MMsINC/search/' title='MMsINC'/><link rel='replies' type='application/atom+xml' href='http://alchemoinformatics.blogspot.com/feeds/9157774816661561342/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/mmsinc.html#comment-form' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/9157774816661561342'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8942442068905528713/posts/default/9157774816661561342'/><link rel='alternate' type='text/html' href='http://alchemoinformatics.blogspot.com/2010/02/mmsinc.html' title='MMsINC'/><author><name>Matteo Floris</name><uri>http://www.blogger.com/profile/13177555699430611910</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='22' height='32' src='http://3.bp.blogspot.com/_eK36skX7Mfk/TBDSbdJfQ4I/AAAAAAAAACw/7kCnO6TT658/S220/matteo.jpg'/></author><thr:total>2</thr:total></entry></feed>
