Crowdsourcing-based natural products database and knowledge discovery system
摘要: 针对目前天然产物数据库数据更新不及时、数据量不够大等问题,开发了基于众包的天然产物数据库及知识发现系统.该系统利用众包技术构建一个天然产物数据库,使用分子指纹对分子结构进行编码,并采用Tanimoto系数计算相似度实现天然产物和相关文献的检索,可以实时扩充,并为生物学家了解当前研究热点,确定进一步研究方向提供参考.Abstract: In view that the present natural products database data update was not in time, the data quantity was not big enough, the natural products database and knowledge discovery system was developed based on crowdsourcing.The system used the crowdsourcing technology to build a natural products database, used molecular fingerprint to encode molecular structure, and used the Tanimoto coefficient to calculate similarity to realize natural products and related literature retrieval,and could expand real-time for biologists to understand the current research hotspot and to offer reference to determine further research directions.
Key words:
- crowdsourcing /
- natural product database /
- knowledge discovery system
NEWMAN D J,CRAGG G M.Natural Products as sources of new drugs from 1981 to 2014[J].Journal of Natural Products,2016,79(3):629.
NTIE-KANG F,MBAH J A,MBAZE L M,et al.CamMedNP:Building the Cameroonian 3D structural natural products database for virtual screening[J].BMC Complementary and Alternative Medicine,2013,13(1):88.
VALLI M,DOS SANTOS R N,FIGUEIRA L D,et al.Development of a natural products database from the biodiversity of Brazil[J].Journal of Natural Products,2013,76(3):439.
HOWE J.The rise of crowdsourcing[J].Wired Magazine,2006,14(6):1.
WEININGER D.SMILES,a chemical language and information system[J].Journal of Chemical Information and Modeling,1988,28(1):31.
WILLETT P.Searching techniques for databases of two-and three-dimensional chemical structures[J].Journal of Medicinal Chemistry,2005,48(13):4183.
HOLLIDAY J D,RANADE S S,WILLETT P.A fast algorithm for selecting sets of dissimilar molecules from large chemical databases[J].Quantitative Structure-Activity Relationships,1995,14(6):501.
Wikipedia.Solution stack[EB/OL].(2016-06-08)[2016-01-02] https://en.wikipedia.org/wiki/Solution_stack.
LEE J,WARE B.Open source development with LAMP:using Linux,Apache,MySQL and PHP[M].New Jersey:Addison-Wesley Professional,2002.
OLBOYLE N M,BANCK M,JAMES C A,et al.Open Babel:an open chemical toolbox[J].J Cheminf,2011,3:33.
COTTOM T L.Using SWIG to bind C++to Python[J].Computing in Science and Engineering,2003,5(2):88.
O'BOYLE N M,MORLEY C,HUTCHISON G R.Pybel:a Python wrapper for the OpenBabel cheminformatics toolkit[J].Chem Cent J,2008,2(1):5.
LIPKUS A H.A proof of the triangle inequality for the Tanimoto distance[J].Journal of Mathematical Chemistry,1999,26(1):263.

- PDF下载量: 99
- 文章访问数: 10138
- 引证文献数: 0