Go to Data Bounty Hub

Spotify Music Crawling Requirements
01KH607211JMV2K0BRKK2J1Z51
Expected Completion Date: 2026-02-27
Publish Date: 2026-02-11
Crawl public music data from the Spotify platform, supporting multi-dimensional filtering for songs, playlists, and albums. Export structured data including track information, artist details, stream counts, platform-exclusive identifiers, and other fields in JSON or CSV format.
I. Search Criteria 📌
Supports multi-condition combined filtering to precisely define the crawling scope and improve data collection efficiency. Specific conditions are listed below:
| Filter Type | Description |
|---|---|
| Keyword Filter | Supports fuzzy matching across song titles, artist/band names, album names, and lyric snippets for fast target content retrieval. |
| Genre Filter | Classifies by music attributes (Pop, Rock, Classical, Electronic, Hip-Hop/Rap, etc.), with single or multiple selection available, aligned with Spotify’s categorization system. |
| Region Filter | Filters by region as defined on the Spotify platform, suitable for crawling popular songs, artists, and exclusive playlists in different regions. |
| Year Filter | Supports precise filtering by single year or range filtering (e.g., 2020–2024), covering both classic and newly released music. |
| Popularity Filter | Sorts and filters by real‑time platform popularity (streams, saves), prioritizing high‑popularity songs and trending playlists. |
| Playlist/Album Filter | Supports precise crawling by playlist ID or album ID, or filtering by playlist type (official curated, user‑created). |
II. Export Fields đź“‹
After crawling, the following complete structured fields are exported without redundancy or missing data, meeting development and practical usage requirements. Field categories are listed below:
| Field Category | Specific Fields | Notes |
|---|---|---|
| Basic Core Fields | Song Title, Artist/Band Name, Album Name | Artist/band labeled with region; album labeled with release date, distinguishing EP, single, and full‑length album. |
| Basic Core Fields | Release Year, Genre, Duration | Duration in seconds; multiple genre tags supported (e.g., “Pop + Electronic”), consistent with Spotify classification. |
| Basic Core Fields | Stream Count, Save Count, Rating | All official Spotify data; rating combines user ratings and platform recommendation index. |
| Basic Core Fields | Region, Language | Region refers to song release or artist origin; language indicates vocal language (multiple languages supported). |
| Album/Playlist Exclusive Fields | Album Cover, Playlist Cover, Track Count | Album/playlist covers provided as high‑definition URLs; track count shows total songs in the album or playlist. |
| Song Exclusive Fields | Lyrics (Basic/Full), Style Tags | Priority given to official Spotify‑associated lyrics; style tags match platform labels (e.g., “Emotional Pop”, “Hardcore Rock”). |
| Spotify Exclusive Fields | Artist Bio, Song ID, Album ID | Artist bio as concise official introduction; IDs are unique platform identifiers for subsequent data association. |
III. Output Formats 📊
Two universal structured formats are available for export based on scenario needs:
| Output Format | Application Scenarios | Key Advantages |
|---|---|---|
| JSON | Program invocation, batch system import, secondary development | Clear structure, optimized for code parsing and automated processing; supports nested fields for artists, albums, and songs. |
| CSV | Tabular viewing, data filtering, Excel editing, data statistics | Simple and intuitive, easy to operate; numerical fields such as stream count and duration use standardized formats for manual analysis. |
Additional Notes (Aligned with Spotify Features)
- Adapted to Spotify‑exclusive content (official curated playlists, artist top tracks, regional charts), ensuring accurate crawling of unique identifiers including Song ID and Album ID.
- Prioritizes crawling publicly accessible platform data; handles copyright restrictions and regional content blocks to ensure compliant and complete collection of lyrics, stream counts, and other fields.
- Differentiates singles, EPs, and full‑length albums with accurate track count labeling, consistent with Spotify’s music classification logic.