Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Future MAX and DAX assets #54

Open
5 tasks
ckadner opened this issue Oct 5, 2021 · 5 comments
Open
5 tasks

Future MAX and DAX assets #54

ckadner opened this issue Oct 5, 2021 · 5 comments
Assignees

Comments

@ckadner
Copy link
Member

ckadner commented Oct 5, 2021

List of MAX models to be migrated into MLX:

  • CodeNet code complexity analysis (currently prototype)
  • ...

List of DAX datasets to be integrated into MLX:

  • FinTabNet - 15 GB
  • Genomics (2 datasets) - 80 GB -> need to confirm if we need this
  • ...

@kmh4321

@ckadner
Copy link
Member Author

ckadner commented Oct 5, 2021

@kmh4321 -- could you provide a tentative list of models and datasets you are planning to integrate into the MLX catalog?

@kmh4321
Copy link
Contributor

kmh4321 commented Nov 8, 2021

We have all the required MAX models on MLX.

For DAX we propose the following datasets be added to the Katalog:

Must haves (either research commitments or DAX is the only home for these datasets currently):

FinTabNet
COVID Question Answers
Oil Reservoir Simulations
Split and Rephrase
Expert in the Loop AI
Taranaki Basin Curated Well Logs
D2A
Genomics BLAST Indices (can revisit with author if they can host it elsewhere as well)
PRROMenade (can revisit with author if they can host it elsewhere as well)

Nice to haves (these datasets have good numbers/metrics but can also be found outside of DAX):

Wildfire dataset ( was part of Call for Code earlier)
Fashion MNIST
Airline on time performance dataset

@ckadner
Copy link
Member Author

ckadner commented May 23, 2022

@kmh4321 -- Regarding missing MAX models, I think we are missing the Audio Classifier still. Did you have a PR for that prepared already? If not I can create one for it?

@Tomcli @animeshsingh -- Do you recall if we deliberately chose not to include the Audio Classifier in the MLX Katalog?

FYI @michaelhind

@Tomcli
Copy link
Member

Tomcli commented May 23, 2022

@ckadner Audio Classifier is not in the initial 10 MAX examples that we want to maintain, and it's no long part of the maintained models?

@kmh4321
Copy link
Contributor

kmh4321 commented May 24, 2022

Hi @ckadner,

I don't have that ready yet. I will be happy to put in the PR over the long weekend or else I'd be happy to review if you plan to create it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants