The ID4 All Hands meeting was held at the Colorado School of Mines on October 13-14, 2025. The MRC was represented by Kio Polson, Scott McClellan, Xintong Zhao, and Dave Breen.
Scott McClellan and Colton Gerber (Toberer Group) presented on MatSci-YAMZ, an LLM-augmented version of Yet Another Metadata Zoo (YAMZ). MatSci-YAMZ is dedicated to studying vocabulary development in the materials science community and investigating the role large language models might play. Their presentation engaged audience members to test the new system by entering terms and definitions as well as commenting and voting on definitions produced by the LLM. Slides from their presentation can be seen here.
Prof. David Breen gave the presentation “AI-Ready Data: Knowledge Extraction from Chemistry Lab Notebooks’. The talk summarized the MRC’s research on converting hand-written chemistry lab notebooks into a structured digital form, making their data AI-ready, i.e. amenable for downstream analysis and model training. The three steps in the conversion process are: 1) automatic segmentation of the notebook pages’ components, 2) extraction of structured data from the components, and 3) error analysis and correction of the data. Slides from Dr. Breen’s presentation can be found here.


