Million Song Dataset

Predicting the release year of a song

Medium

|

2 Submissions

Context

Songs, like any other audio signal, feature distinctive fundamental frequencies, timbre components, and other properties. Each song is unique in these respects, which is why they can be patterned.


Objective

Your task is to use machine learning models to predict the release year (between 1922 and 2011) of a song that is described by 90 attributes of average timbre and covariance.


Evaluation Criteria

Once you generate and submit the target variable predictions on evaluation dataset, your submissions will be compared with the true values of the target variable. 

The True or Actual values of the target variable are hidden on the DPhi Practice platform so that we can evaluate your model's performance on unseen data. Finally, a Root-Mean-Squared-Error (RMSE) score for your model will be generated and displayed.

About the dataset

This database contains 90 attributes of average timbre and covariance. The target variable refers to the year of release per song, between 1922 and 2011.

Download the training set from the following link: https://drive.google.com/file/d/1EjnfKFByNtRbcumGF-cDVLHVM5VPnb4h/view, unzip the file and load the training data in your jupyter notebook, use the below command:

import pandas as pd
songs_data  = pd.read_csv("Training_set_songs.csv" )

Data Description
  • TA01 to TA12 – Timbre avarages
  • TC01 to TC78 – Timbre covariances
  • Year – Release year

Evaluation Dataset

Download the testing set from the following link: https://drive.google.com/file/d/1EjnfKFByNtRbcumGF-cDVLHVM5VPnb4h/view, unzip the file and load the testing data in your jupyter notebook, use the below command:

songs_data = pd.read_csv("Testing_set_songs.csv" )

Here the target column is deliberately not there as you need to predict it.


References

This dataset is adapted from:

T. Bertin-Mahieux. UCI Machine Learning Repository. Irvine, CA: University of California, School of Information and Computer Science. 2019. Available at: http://archive.ics.uci.edu/ml/datasets/YearPredictionMSD.

loading...

You need to choose a submission file.

File Format

Your submission should be in CSV format.

Predictions

This file should have a header row called 'prediction'.
Please see the instructions to save a prediction file under the “Data” tab.

To participate in this challenge either you have to create a team of atleast members or join some team