-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Labels
Description
Introduction
The HEAL program includes hundreds of projects generating highly diverse data sets.
RTI and RENCI, the HEAL Stewards, will
- Use semantic knowledge graphs to link data from disparate studies to facilitate analysis.
- Perform preliminary harmonization of data sets towards the HEAL Common Data Elements (CDE)s
- Provide user friendly interfaces with a biological lens on the data.
HEAL Harmonization
To accomplish this:
- Ingest HEAL CDEs
- Create (or locate) machine readable versions of the HEAL CDEs @gaurav
- Discuss w/NIH how to publish machine readable HEAL CDEs @gaurav
- Annotate with controlled vocabulary and ontology identifiers @gaurav
- Map provided NCI Metathesaurus ids to Human Phenotype Ontology (etc) identifiers @gaurav
- Annotate HEAL CDEs with Monarch's BioLink API SciGraph Named Entity Recognition service (NER) @gaurav
- Look into alternate NER tools for finding terms in HEAL CDEs (https://github.com/helxplatform/development/issues/804) @gaurav
- Convert to Biolink and KGX compliant artifacts @gaurav
- Apply the SRI Normalizer to use Translator preferred identifiers @YaphetKG @gaurav
- Clean up SciGraph annotations and resend cleaned KGX files to Yaphet @gaurav
- Optimize TranQL queries to take advantage of Redisgraph performance. @YaphetKG
- Create new harmonization and translational TranQL queries to @YaphetKG
- Link variables through phenotypes, chemicals, diseases to CDEs @YaphetKG
- Index, recording harmonization connections to enable display
- Ingest HEAL study data as it becomes available @waTeim
- Begin with
- The SPARC knowledge graph. @HowardLander
- NIDA (as transformed/provided by Comp 2) @warrenstephens
- Create a Dug parser for each data format.
- Begin with
- Update the HeLx/Dug UI to @mbwatson
- Render
- An "All" tab for generic search results including lexical matches.
- A "Harmonized" tab for everything else with markers for CDE, PhenX, Biolink, and other groupings.
- Allow deployment with or without
- Authentication
- App Workspaces
- Present an information dense display minimizing paging and scrolling
- Render
Design
PM Tracking
@hhiles to work with @vgardner-renci and Kathy to report on the following HEAL epics and their related GitHub tickets; give updates directly to PMs or plug into Monday.com
- Index Data Dictionaries
- Develop Automated Curation and Search
- Enable Cloud Based Semantic Search
