A coalition of pharmaceutical companies has pooled thousands of experimentally determined protein-ligand structural data to train an advanced large language model, aiming to boost the power and efficiency of AI in drug discovery. This collaborative initiative enhances chemical diversity in training sets, potentially revolutionizing molecular design and expediting novel therapeutic developments.