Difference between revisions of "HipHop Genealogy"
From CCRMA Wiki
(→Data Files) |
|||
Line 9: | Line 9: | ||
List of each possible genre in the dataset. Handwritten and used by <tt>categorize.py</tt> for manual genre entry. | List of each possible genre in the dataset. Handwritten and used by <tt>categorize.py</tt> for manual genre entry. | ||
* meta.json | * meta.json | ||
+ | The main catalogue of metadata associated with each WAV file. Currently includes genre and artist(s) info, in addition to file paths of compressed/WAV versions of the audio data. | ||
* wav | * wav | ||
+ | Directory of uncompressed audio data files. Automatically populated by <tt>decompress.py</tt> | ||
== Utility Tools == | == Utility Tools == |
Revision as of 00:58, 10 November 2013
In /usr/ccrma/media/databases/hiphop-gene/ are the following files:
Data Files
- artists.json
A list of each artist in the dataset. Rather than extracted from tags in the mp3 file, they are hand-entered via categorize.py to ensure correct normalization.
- compressed
Loosely organized directory of mp3/m4a/etc. files for the base data set.
- genres.json
List of each possible genre in the dataset. Handwritten and used by categorize.py for manual genre entry.
- meta.json
The main catalogue of metadata associated with each WAV file. Currently includes genre and artist(s) info, in addition to file paths of compressed/WAV versions of the audio data.
- wav
Directory of uncompressed audio data files. Automatically populated by decompress.py
Utility Tools
- decompress.py
- build_artists.py
- categorize.py
- export_mat.py