Chemical synthesis is critical across life sciences, materials, and energy, yet conventional experimentation remains labor‑intensive. This work introduces Chemma, a fully fine‑tuned large language model trained on 1.28 million Q&A pairs about chemical reactions. Chemma surpasses existing benchmarks in tasks such as single‑step retrosynthesis and yield prediction, demonstrating that gene…
Chemical synthesis is foundational across fields like pharmaceuticals, materials, and energy, yet current workflows often rely on repetitive trial-and-error, making them time-consuming and resource-intensive. To overcome these limitations, modern large language models (LLMs), exemplified by GPT‑4, have been fine‑tuned on millions of domain-specific Q&A pairs, creating a specialized assistan…