Training a large language model to do scientific data extraction for a corpus of materials properties

A wealth of experiments and simulation results are contained in the scientific literature. Extracting it in a machine-readable format is daunting as it typically requires domain expertise, rigid schema, or intensive labor. Recently, Dunn et al.... continue reading


This is the Journal of Brief Ideas - citable ideas in fewer than 200 words.

Before you can create a new idea, you'll need to log in using the link above. You also can't vote on existing ideas without signing in too.


Click on the icon to vote on an idea. You can't vote on your own ideas.