Jump to content

HipHop Genealogy: Difference between revisions

From CCRMA Wiki
Spencer (talk | contribs)
No edit summary
Spencer (talk | contribs)
Line 9: Line 9:
List of each possible genre in the dataset. Handwritten and used by <tt>categorize.py</tt> for manual genre entry.  
List of each possible genre in the dataset. Handwritten and used by <tt>categorize.py</tt> for manual genre entry.  
* meta.json
* meta.json
The main catalogue of metadata associated with each WAV file. Currently includes genre and artist(s) info, in addition to file paths of compressed/WAV versions of the audio data.
* wav
* wav
Directory of uncompressed audio data files. Automatically populated by <tt>decompress.py</tt>


== Utility Tools ==  
== Utility Tools ==  

Revision as of 08:58, 10 November 2013

In /usr/ccrma/media/databases/hiphop-gene/ are the following files:

Data Files

  • artists.json

A list of each artist in the dataset. Rather than extracted from tags in the mp3 file, they are hand-entered via categorize.py to ensure correct normalization.

  • compressed

Loosely organized directory of mp3/m4a/etc. files for the base data set.

  • genres.json

List of each possible genre in the dataset. Handwritten and used by categorize.py for manual genre entry.

  • meta.json

The main catalogue of metadata associated with each WAV file. Currently includes genre and artist(s) info, in addition to file paths of compressed/WAV versions of the audio data.

  • wav

Directory of uncompressed audio data files. Automatically populated by decompress.py

Utility Tools

  • decompress.py
  • build_artists.py
  • categorize.py
  • export_mat.py