Reach new data frontiers with HeLx
- Provides a complete data science laboratory for research in the cloud.
- Solves technical infrastructure challenges to focus researchers on science.
- Brings the right tools for each team together in a secure, scalable portal.
- Includes bring your own data access patterns.
Implement tools and solutions to explore your data
- Existing applications include notebooks, imaging, Apache Spark, and Nextflow workflows.
- Includes intuitive full text driven, knowledge graph augmented semantic search.
- Open source technology supports multiple authentication providers and authorization models.
- Scalable cloud native architecture with Kubernetes, Apache Airflow, and Nextflow.
Launch a flexible workspace for any science domain
- Provides a unified data science workbench tailored to each science community.
- Extensible metadata driven architecture simplifies adding data science workspaces.
- Applicable to and actively used across science domains.

The consortium’s semantic search is powered by HeLx. It has indexed TOPMed phenotypic concepts and dbGaP study variables for all studies in BioData Catalyst. The full text search also uses ontologies and controlled vocabularies to discover connections between elements of data sets, performing a form of automated curation.
For Scientific Discovery at Scale (SciDAS), HeLx executes the Nextflow workflow engine as an app in Kubernetes to drive highly parallel, GPU intensive workflows in Kubernetes. The system takes full advantage of the cloud by dynamically scaling costly GPU resources according to demand.

Blackbalsam is an open source data science environment with an initial focus on COVID-19 and North Carolina. Blackbalsam's interface uses a Jupyter notebook environment featuring artificial intelligence, visualization, and scalable computing capabilities.
BRAIN-I is a computational infrastructure for handling huge images combined with a discovery environment where scientists can run applications and do their analysis. It deals with big data and computation in a user-friendly way so scientists can concentrate on their science.