From its base near Cambridge, EMBL’s European Bioinformatics Institute has become a critical part of global life sciences. Its story is one of open data, international collaboration and careful stewardship, with today’s challenges centred on scale, artificial intelligence, talent and the long-term resilience of shared scientific infrastructure for everyone globally.
EMBL’s European Bioinformatics Institute, widely known as EMBL-EBI, occupies a distinctive place in the life sciences economy. Based at Hinxton, near Cambridge, it is one of the six sites of the European Molecular Biology Laboratory, an intergovernmental organisation established by international treaty rather than by the European Union. EMBL-EBI itself was created in the 1990s to give Europe a dedicated home for bioinformatics, at a time when biological research was beginning to generate data at a speed traditional publishing and laboratory practice could not easily manage. Its roots reach into earlier efforts to collect and share nucleotide sequence data, but its growth has mirrored the wider rise of genomics, computational biology and data-driven discovery. What began as a specialist scientific service has become a central piece of global research infrastructure. Today, EMBL-EBI maintains major open data resources and analysis tools used by researchers, clinicians, technology companies and public bodies. Its purpose remains straightforward: to help scientists realise the potential of big data in biology and to make high-quality biological information available for reuse.
The institute’s history is also a history of openness as a practical operating model. EMBL-EBI provides freely available and up-to-date molecular data resources across areas including genomes, nucleotide sequences, proteins, gene expression, small molecules, pathways, diseases, samples and ontologies. Resources such as the GWAS Catalog, the International Genome Sample Resource, AlphaFold DB, UniProt services and many others support work that ranges from basic laboratory research to clinical interpretation and industrial development. The scale of this contribution is not simply academic. An independent report by Frontier Economics found that EMBL-EBI’s open data resources generate multibillion-pound value each year, including an estimated £11.8 billion in annual productivity gains across public and private sectors. The same work reported benefits more than 100 times higher than the cost of maintaining the resources, while many users said EMBL-EBI enables research that would otherwise be impossible or significantly slower. For business readers, that is a powerful reminder that shared infrastructure can create measurable value far beyond the organisation that hosts it.
The pressures facing EMBL-EBI are the pressures facing the wider life sciences industry, only at greater intensity. Biological data is growing rapidly as sequencing, imaging, proteomics and population-scale health studies become more accessible. At the same time, artificial intelligence is changing expectations about what biological databases can do. High-quality, well-curated and machine-readable data is essential for training new tools, but speed cannot replace trust. EMBL-EBI’s response has been to combine open science with data stewardship, expert curation and long-term service continuity. Its teams enhance data through annotation, integration with scientific literature and quality control, while also exploring text mining, machine learning, large language models and other AI techniques where they can improve scale and reliability. The AlphaFold story illustrates the point clearly. Open data stored at EMBL-EBI played an important role in the development of an AI system that transformed protein structure prediction. The lesson is that AI breakthroughs depend not only on algorithms, but also on dependable public data resources built over many years.
Another current challenge is people. EMBL-EBI describes science as international, and more than three quarters of its workforce has joined from outside the UK. That matters because bioinformatics sits at the intersection of biology, computing, statistics, infrastructure engineering and service delivery. The UK’s departure from the European Union could have created uncertainty for recruitment, funding and collaboration, but EMBL-EBI’s status provides a degree of continuity. As part of EMBL, it is not formally linked to the European Union, and the UK’s membership of EMBL is independent of EU membership. The institute continues to recruit internationally, supports staff and families moving to the UK, and can help eligible employees apply for an exempt vignette. This practical support is more than a human resources detail; it is part of maintaining a world-class scientific service. In an industry where talent is scarce and mobility matters, the ability to attract experienced people from across the world is central to resilience. EMBL-EBI’s employment model reflects its scientific model: international, collaborative and designed for long horizons.
EMBL-EBI’s history shows how patient investment in shared scientific infrastructure creates lasting public value worldwide. Its response to today’s pressures combines open access, careful stewardship, and practical support for researchers. For businesses, the lesson is that trusted data services can strengthen entire innovation ecosystems globally. As AI changes life sciences, EMBL-EBI is helping ensure progress remains accountable and reusable widely. Its continuing challenge is preserving openness while meeting rising demand, technical complexity, and public expectations.




