High-content screening (HSC) assays such as Cell Painting represent a rich data source for phenotypic drug discovery. Machine learning (ML) and Artificial Intelligence (AI) methods for image-based profiling are enabling a new era in phenotypic drug discovery. In particular, introducing ML into your workflows enhances the quality of prediction through multimodal approaches. You can read more about this in our previous article: High Content Screening: Redefining What Is Possible with Artificial Intelligence and Machine Learning.
Human-defined morphological features, such as size, shape, intensity, granularity, and texture, can be extracted from HCS data by using image analysis tools such as CellProfiler1. However, this can take hundreds of hours and may not be able to extract all the insights packed in the images. ML methods not only automate image analysis but also enable feature extraction that extends beyond human-defined morphological features.
In addition to the phenotypic features, chemical features of the tested compounds also provide a valuable source of data for predicting potential drug candidates. To empower researchers and accelerate phenotypic drug discovery, we have developed the Ardigen phenAID platform, a solution that introduces multimodal predictive capabilities to HSC image analysis.
Analyzing phenotypic and chemical features with Ardigen phenAID
Ardigen phenAID is an AI-powered platform that combines multiple modalities, such as image and chemical structure data, to enhance the quality of predictions. PhenAID enables automated extraction of diverse morphological features, together with the classification and clustering of cellular phenotypes to help identify potential drug candidates. In addition to extracting phenotypic insights from HCS data, phenAID incorporates information about the chemical structure of the screened compounds.
This multimodal approach provides powerful insights into the Mode of Action (MoA) and bioactivity property predictions. To show its effectiveness, we benchmarked this method using a validated, proprietary dataset from Merck containing thousands of compounds.
Ardigen phenAID platform applied to a proprietary dataset from Merck KGaA

Multimodal approach boosts the accuracy of prediction

Works Cited:
- Stirling, D.R., et al. (2021). “CellProfiler 4: improvements in speed, utility and usability”, BMC Bioinformatics 22, 433. DOI: 10.1186/s12859-021-04344-9.
- Rumetshofer, E., et al. (2018). “Human-level protein localization with convolutional neural networks,” International conference on learning representations.
- Maziarka, Ł. et al. (2024). “Relative molecule self-attention transformer,” J Cheminform 16, 3. DOI: 10.1186/s13321-023-00789-7
- Rogers, D., and Hahn, M. (2010). “Extended-Connectivity Fingerprints”, J. Chem. Inf. Model. 50, 5, 742–754 DOI: 10.1021/ci100050t