Migrating clinical workflows to Nextflow at a national scale

Genomics England, a leading provider of whole-genome sequencing diagnostics for the UK’s Genomic Medicine Service, aimed to scaleoperations to process 300,000 samples by 2025. To support this growth, they needed to transition from their internal workflow engine (Bertha) to a more scalable and efficient solution—Genie—built on Nextflow and the Sequera Platform.

Challenge

The migration required preserving the integrity of genomic diagnostics while ensuring automation, scalability, and efficiency. The transition needed to be seamless, with minimal disruption to existing workflows supporting cancer, rare disease, and population-scale studies.

Approach

The project followed a structured migration strategy:

  • A “Lift and Shift” approach was adopted to ensure a smooth transition while maintaining patient benefits.
  • Agile methodology with bi-weekly sprints and continuous stakeholder feedback enabled iterative improvements.
  • The Jasmine testing framework and containerized comparison tests ensured workflow reliability.

Results

  • Regression testing time was reduced from days to minutes.
  • Successful migration to Nextflow, enhancing automation and orchestration of bioinformatics workflows.
  • Improved efficiency in processing a growing volume of sequencing data.

3 key programmes

8+ workflows

9+ bioinformatics components

  • Cancer
  • Rare disease  
  • Generation study

Along with its  automation and orchestration

Conclusion

By leveraging a combination of off-the-shelf and customized bioinformatics solutions, Genomics England successfully modernized its genomic workflow infrastructure. The transition to Nextflow provided a scalable, high-performance environment, positioning the organization to meet its ambitious sequencing goals.

Expert Contribution

Reviewed by: Dr. Kamil Malisz, PhD
Role: Lead Workflows Developer and Solution Architect, AI‑Driven Drug Discovery
Expertise: Bioinformatics pipeline development, Nextflow workflow engineering, phage display and whole-genome sequencing, life sciences software engineering, project and team leadership.

Want to achieve similar efficiency gains? Let's discuss how we can optimize your bioinformatics workflows.

You might be also interested in:

Blog cover for Ardigen publication on ARDisplay-I and MHC ligand identification in Molecular & Cellular Proteomics
New publication in MCP: Improving MHC ligand identification with machine learning and optimized isolation
Fluorescence microscopy image of cells stained with multiple Cell Painting dyes showing cellular organelles in green, blue, and pink, overlaid with Ardigen brand graphic elements indicating phenomics data in durg discovery
End to End Data-to-Decision Journey for AI-Driven Phenomics in Drug Discovery
Abstract network visualization representing AI-driven integration of biological data and knowledge graphs for target identification in drug discovery.
Target Identification: From Poor Data to Quality Predictions
Abstract data streams representing data sourcing in pharmaceutical research and AI drug discovery
What Are Common Data Sourcing Patterns in Pharmaceutical Research (part 3)

Contact

Ready to transform drug discovery?

Discover how one of the top AI CROs in the world, can be your trusted partner in revolutionizing drug discovery through AI.

Contact us today to learn more about our tailored solutions for empowering your drug development journey.

Send us a message and we will contact you back within 48 hours.

Newsletter

Become an insider

Be the first to know about Ardigen’s latest news and get access to our publications, webinars and more!