Skip to content

Conversation

@AlexanderGress
Copy link

Hi,

I implemented some features:

  • Search MaveDB with some filters to get a list of urns (or without filters to get all of them)
  • Download scoretables from MaveDB
  • Create a local clone of MaveDB
  • Different features that aim to create datasets used in Machine Learning applications, like data aggregation and effect value scaling (currently limited to SAVs)

If you have questions or anything: alexander.gress@helmholtz-hips.de

best wishes,
Alexander Gress

ML tools include
 - aggregation of scoresets from same expirements
 - scaling effect values
 - fetching whole protein sequences and updating the hgvs_pro ids
   accordingly
 - outputting datasets in a specialized fasta format
 - currently works just for savs
Implementating a search feature to collect urns with coresponding meta
data from the database.
Added the feature to actually download score tables
Also added features for cloning the whole database locally
Added three example scripts to show how to use the ML features
- also updated procession of MaveDB data to newest standards
- capable to process the now downloadable full MaveDB
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant