Форма представления | Статьи в зарубежных журналах и сборниках |
Год публикации | 2022 |
Язык | английский |
|
Маджидов Тимур Исмаилович, автор
|
Библиографическое описание на языке оригинала |
Akhmetshin T. HyFactor: A Novel Open-Source, Graph-Based Architecture for Chemical Structure Generation / Akhmetshin T., Lin A., Mazitov D., Zabolotna Yu., Ziaikin E., Madzhidov T., Varnek A. // Journal of Chemical Information and Modeling. - 2022. - Vol. 62, Is. 15. - P. 3524-3534. |
Аннотация |
Graph-based architectures are becoming increasingly popular as a tool for structure generation. Here, we introduce novel open-source architecture HyFactor in which, similar to the InChI linear notation, the number of hydrogens attached to the heavy atoms was considered instead of the bond types. HyFactor was benchmarked on the ZINC 250K, MOSES, and ChEMBL data sets against conventional graph-based architecture ReFactor, representing our implementation of the reported DEFactor architecture in the literature. On average, HyFactor models contain some 20% less fitting parameters than those of ReFactor. The two architectures display similar validity, uniqueness, and reconstruction rates. Compared to the training set compounds, HyFactor generates more similar structures than ReFactor. This could be explained by the fact that the latter generates many open-chain analogues of cyclic structures in the training set. It has been demonstrated that the reconstruction error of heavy molecules can be significantly reduced using the data augmentation technique. The codes of HyFactor and ReFactor as well as all models obtained in this study are publicly available from our GitHub repository: https://github.com/Laboratoire-de-Chemoinformatique/HyFactor. |
Ключевые слова |
Chemical structure,Embedding,Layers,Molecular structure,Molecules |
Название журнала |
Journal of Chemical Information and Modeling
|
URL |
https://pubs.acs.org/doi/10.1021/acs.jcim.2c00744 |
Пожалуйста, используйте этот идентификатор, чтобы цитировать или ссылаться на эту карточку |
https://repository.kpfu.ru/?p_id=281120 |
Полная запись метаданных |
Поле DC |
Значение |
Язык |
dc.contributor.author |
Маджидов Тимур Исмаилович |
ru_RU |
dc.date.accessioned |
2022-01-01T00:00:00Z |
ru_RU |
dc.date.available |
2022-01-01T00:00:00Z |
ru_RU |
dc.date.issued |
2022 |
ru_RU |
dc.identifier.citation |
Akhmetshin T. HyFactor: A Novel Open-Source, Graph-Based Architecture for Chemical Structure Generation / Akhmetshin T., Lin A., Mazitov D., Zabolotna Yu., Ziaikin E., Madzhidov T., Varnek A. // Journal of Chemical Information and Modeling. - 2022. - Vol. 62, Is. 15. - P. 3524-3534. |
ru_RU |
dc.identifier.uri |
https://repository.kpfu.ru/?p_id=281120 |
ru_RU |
dc.description.abstract |
Journal of Chemical Information and Modeling |
ru_RU |
dc.description.abstract |
Graph-based architectures are becoming increasingly popular as a tool for structure generation. Here, we introduce novel open-source architecture HyFactor in which, similar to the InChI linear notation, the number of hydrogens attached to the heavy atoms was considered instead of the bond types. HyFactor was benchmarked on the ZINC 250K, MOSES, and ChEMBL data sets against conventional graph-based architecture ReFactor, representing our implementation of the reported DEFactor architecture in the literature. On average, HyFactor models contain some 20% less fitting parameters than those of ReFactor. The two architectures display similar validity, uniqueness, and reconstruction rates. Compared to the training set compounds, HyFactor generates more similar structures than ReFactor. This could be explained by the fact that the latter generates many open-chain analogues of cyclic structures in the training set. It has been demonstrated that the reconstruction error of heavy molecules can be significantly reduced using the data augmentation technique. The codes of HyFactor and ReFactor as well as all models obtained in this study are publicly available from our GitHub repository: https://github.com/Laboratoire-de-Chemoinformatique/HyFactor. |
ru_RU |
dc.language.iso |
ru |
ru_RU |
dc.subject |
Chemical structure |
ru_RU |
dc.subject |
Embedding |
ru_RU |
dc.subject |
Layers |
ru_RU |
dc.subject |
Molecular structure |
ru_RU |
dc.subject |
Molecules |
ru_RU |
dc.title |
HyFactor: A Novel Open-Source, Graph-Based Architecture for Chemical Structure Generation |
ru_RU |
dc.type |
Статьи в зарубежных журналах и сборниках |
ru_RU |
|