fma
clusterdata
fma | clusterdata | |
---|---|---|
1 | 1 | |
2,108 | 1,495 | |
- | 2.5% | |
0.0 | 4.5 | |
over 1 year ago | 9 months ago | |
Jupyter Notebook | Jupyter Notebook | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
fma
-
Analyzing music to determine subgenre?
This dataset seems worth looking into: https://github.com/mdeff/fma. I think you'll have a hard time identifying subgenres since even people don't know what subgenre a song belongs to. It's a very subjective classification compared to distinguishing between main genres; e.g. rock, rap, and country. Also, from my work with the Spotify API, there a lot of seemingly synonymous subgenres which will make this task even more tedious (what is the difference between "pop dance" and "dance pop"?).
clusterdata
What are some alternatives?
mac-miller-lyrics-dataset - Dataset with lyrics from Mac Miller
waymo-open-dataset - Waymo Open Dataset
SKAB - SKAB - Skoltech Anomaly Benchmark. Time-series data for evaluating Anomaly Detection algorithms.
raccoon_dataset - The dataset is used to train my own raccoon detector and I blogged about it on Medium
toiletmap - API/UI server for the Great British Public Toilet Map
whylogs - An open-source data logging library for machine learning models and data pipelines. 📚 Provides visibility into data quality & model performance over time. 🛡️ Supports privacy-preserving data collection, ensuring safety & robustness. 📈
essentia - C++ library for audio and music analysis, description and synthesis, including Python bindings
datasets - 🎁 5,400,000+ Unsplash images made available for research and machine learning
covid19za - Coronavirus COVID-19 (2019-nCoV) Data Repository and Dashboard for South Africa
covid-chestxray-dataset - We are building an open database of COVID-19 cases with chest X-ray or CT images.
TheVault - [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation