Example workflow

Music classification from CSV audio features

Q: Can MLdeck classify music tracks from CSV features?

Yes, MLdeck can run music classification from tabular audio features as a CSV classification workflow.

Q: Why does loudness sometimes dominate music classification?

Loudness may separate classes, correlate with genre or mastering, or signal dataset bias that needs review.

Q: Should I drop track ID or artist columns?

Track identifiers are usually dropped. Artist columns may need careful review because they can leak genre or dataset grouping.

Q: Can duplicate songs inflate accuracy?

Yes. Duplicate tracks, albums, or artists across splits can make metrics look stronger than they are.

Q: Can I export a music classifier from MLdeck?

Yes. Exported artifacts include schema and manifest metadata for validation and runtime testing.

This example walks through training a music-genre classifier from a CSV of audio features — entirely in your browser. It covers Data Quality review, target selection, leakage checks, baseline comparison, validation evidence, and export artifact review.

Train with your own CSV after sign-in Review data-quality checks

What music classification from CSV means

Music classification from CSV is a tabular classification workflow. Instead of training on raw audio files, the dataset contains extracted features. Each row may describe one track, a segment, or an aggregated song profile. The target may be genre, mood_label, or hit_or_not. The model learns patterns from numeric and categorical columns rather than listening to audio directly.

This makes the workflow approachable for education and practical feature review. It also means the quality of the feature extraction process matters. A CSV can be useful for assessing whether simple features separate classes, but it does not replace audio-domain validation or human review.

Example music dataset structure

A typical music CSV might include track_id, artist, genre, tempo, loudness, energy, danceability, acousticness, duration, key, and mode. The target could be the genre label, a mood category, or a binary hit label. If the target is genre, it should not also appear encoded in another field.

Identifiers like track_id usually do not belong in training. Artist and album columns require judgment. They may be meaningful context, but they can also create leakage-like behavior if the same artist appears repeatedly and the task is essentially remembering artist-to-genre mappings.

Choosing the classification target

Choose a target that represents the class you want to predict. For genre classification, the target may be genre. For mood classification, it may be mood_label. For commercial exploration, it may be hit_or_not, though that label often depends on time, market, playlisting, and promotion. If the target has many rare categories, the model may struggle or overfit.

MLdeck can help inspect target distribution and class balance. If one genre dominates the dataset, majority-class baseline performance may already be high. The trained model should beat that baseline meaningfully before the result is interesting.

Why some features can dominate the model

A feature such as loudness may appear very important for several reasons. It may genuinely separate classes. It may correlate with genre because certain genres are mastered differently. It may reflect production era, label, or dataset source. It may also indicate a bias if the dataset was assembled from playlists or platforms with different normalization rules.

Dominant features are not automatically wrong. They are a prompt for review. If loudness predicts genre too well, inspect whether duplicates, source artifacts, or preprocessing differences are shaping the result. In music workflows, high apparent accuracy can come from dataset construction rather than generalizable acoustic structure.

Feature checks for loudness, tempo, energy, and genre labels

Check numeric ranges for tempo, loudness, energy, acousticness, and danceability. Missing values may indicate failed feature extraction. Extreme values may reflect parsing issues. Categorical fields such as key and mode should have expected ranges. Genre labels should be consistent; "hip hop", "Hip-Hop", and "rap" may represent different labels unless cleaned intentionally.

When using MLdeck, review type inference, missingness, and warnings before training. A useful model comparison depends on a sensible feature table. If the labels are noisy or inconsistent, the leaderboard may reflect label cleanup issues instead of musical structure.

Preprocessing and model comparison in MLdeck

MLdeck can preprocess numeric and categorical columns, compare candidate classifiers, and show leaderboard evidence. Numeric features may be scaled or imputed. Categorical columns may be encoded. The workflow is browser-local during normal training flows, so raw CSV training data is not uploaded to a cloud training service for this exploration.

Compare simple and more flexible models. If every model performs similarly, the feature set may have limited signal. If one model performs far better, inspect whether it is using a suspicious feature. Treat model comparison as a diagnostic step, not a final endorsement.

Interpreting high accuracy carefully

High accuracy can be real, but it can also be a warning sign. Duplicate tracks, album duplicates, artist repetition, or target leakage can inflate metrics. If songs from the same artist or album appear in both training and evaluation rows, the model may learn the artist signature rather than generalize to new music.

For stronger validation evidence, group-aware splits by artist, album, or track family may be useful. MLdeck's public example shows how browser-local review can surface those questions for music recommendation, tagging, and catalog workflows.

Data leakage and duplicate-track risks

Drop track_id and other row identifiers. Review artist and album fields carefully. Remove columns created after the label, such as playlist placement after a song became popular, or editorial tags that already encode the target. Duplicate rows should be investigated because they can make evaluation evidence optimistic.

Exporting classification artifacts

After exploration, MLdeck can export artifacts such as ONNX, Docker packages, Python files, and PDF reports with schema and manifest metadata for validation and runtime testing. Music classifier reviews often include representative tracks, missing feature cases, rare genres, and new artists. The export is designed for portable ONNX Runtime inference with parity-review context.

Music classification FAQ

Can MLdeck classify music tracks from CSV features?

Yes. It can explore music classification from tabular audio features in a browser-based CSV workflow.

Why does loudness sometimes dominate music classification?

It may genuinely separate labels, correlate with genre, reflect mastering patterns, or reveal dataset bias.

Should I drop track ID or artist columns?

Track IDs are usually dropped. Artist columns need review because they can inflate results if artists repeat across splits.

Can duplicate songs inflate accuracy?

Yes. Duplicate tracks, albums, or artists can make evaluation look stronger than future performance.

Can I export a music classifier from MLdeck?

Yes. Exports include schema and manifest metadata for validation, runtime testing, and parity review.

Related examples and guides

Explore similar browser-local CSV classification and export workflows.

Customer churn prediction from CSV Data quality checks for ML Data Quality for Machine Learning AutoML validation evidence AutoML export artifacts ONNX export workflow Browser-local AutoML Train ML models in your browser Local AutoML for CSV files