Abstract
Deep neural networks (DNNs) based quantitative structure–property relationship (QSPR) studies are receiving increasing attention due to their excellent performances. A systematic methodology coupling multiple machine learning technologies is proposed to systematically solve vital problems including applicability domain and prediction uncertainty in DNN-based QSPR modeling. Key features are rapidly extracted from plentiful but chaotic descriptors by principal component analysis (PCA) and kernel PCA. Then, a detailed applicability domain (AD) is defined by K-means algorithm to avoid unreliable predictions and discover its potential impact on prediction uncertainty. Moreover, prediction uncertainty is analyzed with dropout-embedded DNN by thousands of independent tests to assess the reliability of predictions. The prediction of flashpoint temperature is employed as a case study, demonstrating that the model accuracy is remarkably improved comparing with the referenced model. Furthermore, the proposed methodology breaks through difficulties in analyzing the uncertainty of DNN-based QSPRs and presents an AD correlated with the uncertainty.
| Original language | English |
|---|---|
| Article number | e17402 |
| Number of pages | 15 |
| Journal | AICHE Journal |
| Volume | 68 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - Jan 2022 |
Keywords
- deep neural network
- flashpoint
- principal component analysis
- QSPR
- uncertainty analysis
ASJC Scopus subject areas
- Biotechnology
- Environmental Engineering
- General Chemical Engineering
Fingerprint
Dive into the research topics of 'A systematic modeling methodology of deep neural network-based structure-property relationship for rapid and reliable prediction on flashpoints'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver