Abstract
Artificial intelligence (AI) plays a significant role in advancing polymer science and engineering. Considering the critical role of the glass transition temperature (Tg) in determining the physical properties of polymers, this study systematically investigates the influence of their composition and sequence structure on Tgusing machine learning (ML) models. To clarify the complex relationship between polymer composition and Tg, the k-nearest neighbor mega-trend diffusion (kNNMTD) method was employed for data augmentation, and various ML models were constructed for Tgprediction. Among them, the Random Forest model demonstrated the best performance for the generated data, achieving an R2of 0.85 and an RMSE of 0.38. To explore the effect of polymer sequence structure on Tg, we further introduced natural language processing (NLP) techniques to represent polymer sequences. The data was augmented using the Wasserstein generative adversarial network (GAN) with gradient penalty (WGAN-GP) model, and Tgpredictions were made using a convolutional neural network-long short-term memory (CNN-LSTM) model. This integrated framework achieved excellent predictive performance, with an R2of 0.95 and an RMSE of 0.23, and demonstrated strong generalization across different data sets. In summary, this study introduces an innovative application of kNNMTD for augmenting polymer composition data combined with NLP techniques for representing polymer sequences. The proposed ML framework offers a valuable contribution to the advancement of polymer material design and optimization.
| Original language | English |
|---|---|
| Pages (from-to) | 7478-7492 |
| Number of pages | 15 |
| Journal | Journal of Chemical Information and Modeling |
| Volume | 65 |
| Issue number | 14 |
| DOIs | |
| State | Published - 28 Jul 2025 |
| Externally published | Yes |
Fingerprint
Dive into the research topics of 'Developing Hybrid Machine Learning Frameworks for Polymer Property Prediction Based on Composition and Sequence Features'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver