Home Millions of Songs Found in AI Training Datasets, Investigation Reveals

Millions of Songs Found in AI Training Datasets, Investigation Reveals

A new investigation reveals that massive datasets containing millions of songs are being used to train AI music models, intensifying copyright disputes.

Tech & Innovation

June 16, 2026

byDaniel Obembe

A digital interface displaying a searchable database of millions of song titles and artist names used in AI training datasets.

An investigation has uncovered four large-scale music datasets being shared within the AI-development community, with the largest containing 12 million tracks and another holding 9 million songs. The findings shed new light on the scope of copyrighted material potentially used to train generative AI music models without licensing agreements.

Alex Reisner, who examined the datasets, highlighted the gap between industry claims and the reality of accessible data.

“Companies often claim to use only content that is freely available online, but the datasets reveal the quantity of downloadable music that developers can access even though it is not supposed to be free,” Reisner wrote.

The revelation comes as Universal Music Group and Sony Music recently sought to add more than 61,000 recordings to their copyright infringement lawsuit against AI music service Suno, a move Suno is opposing. The datasets have been downloaded thousands of times, but it remains unclear which companies have used them for training. The investigation has made the collections searchable, allowing the public to see which songs are included.

Tech & Innovation

June 16, 2026

byDaniel Obembe

Martin Garrix performing on stage with pink, blue, and purple lasers, debuting the Madonna collaboration 'Bizarre' at Barclays Center.

Martin Garrix Debuts Madonna Collaboration 'Bizarre' in New York

Culture & Lifestyle

June 16, 2026

OnlyFans Manager Exploitation Raises Red Flags for Musicians

Tech & Innovation

June 16, 2026

Recommended for You

OnlyFans Manager Exploitation Raises Red Flags for Musicians

Tech & Innovation

byDave Ayodeji

The Salt Shed music venue in Chicago, a former Morton Salt factory redeveloped by 16” on Center.

Chicago Venue Collective 16OC Thrives on Community and Diversification

Tech & Innovation

byPatrick Ofe

Molly Neuman, president of CD Baby, speaking at The Great Escape music festival and conference in Brighton, England.

CD Baby’s Molly Neuman Shares Advice for Independent Musicians

Tech & Innovation

byDaniel Obembe

Spotify Beach at Cannes Lions 2026: Full Lineup of Talks and Performances

Tech & Innovation

byPatrick Ofe

Infographic from Spotify's 2026 Francophone Report highlighting royalty figures and global streaming data for French music.

Spotify 2026 Francophone Report: €319m Royalties, Half From Outside France

Tech & Innovation

byDave Ayodeji

A world map highlighting the global reach of French-language music, podcasts, and audiobooks on Spotify in 2025.

148 Million Global Listeners Tune Into French-Language Content on Spotify in 2025

Tech & Innovation

byGrace Wangeci

A collage of concert crowds and city skylines representing Arab diaspora music scenes in London, Paris, New York, and Los Angeles.

Arabic Music’s Global Growth Is Being Driven by Diaspora Communities, Not Regional Labels

Tech & Innovation

byPatrick Ofe

Spotify for Artists interface showing the video upload option for musicians.

Spotify Launches Direct Video Upload Beta for Artists

Tech & Innovation

byGrace Wangeci

Dan Fowler Independent Music Report Released in Six Languages

Universal Music Group Sells Curve Royalty Systems to Merlin and Jamen Capital

Cantilever Closes £250,000 Pre-Seed Round with Independent Label Backing

Latest Posts

InterSpace Distribution Signs Partnership Deal With ACRCloud

Warner Music Group and Paramount Strike Deal to Produce Biopics on WMG Artists

Believe Announces AZTEC, Its First U.S. Label

Most Discussed

Davido Signs Boi Chase to DMW in Grand Dinner Ceremony

Nigeria proudly on the global stage: 2026 Grammy nominations for Afrobeats

Google’s Lyria 3 Signals a Major Leap in AI Music Innovation

Dan Fowler Independent Music Report Released in Six Languages

Universal Music Group Sells Curve Royalty Systems to Merlin and Jamen Capital

Cantilever Closes £250,000 Pre-Seed Round with Independent Label Backing

Dan Fowler Independent Music Report Released in Six Languages

Universal Music Group Sells Curve Royalty Systems to Merlin and Jamen Capital

Millions of Songs Found in AI Training Datasets, Investigation Reveals

Martin Garrix Debuts Madonna Collaboration 'Bizarre' in New York

OnlyFans Manager Exploitation Raises Red Flags for Musicians

@InterSpace.Africa

Recommended for You

OnlyFans Manager Exploitation Raises Red Flags for Musicians

Chicago Venue Collective 16OC Thrives on Community and Diversification

CD Baby’s Molly Neuman Shares Advice for Independent Musicians

Spotify Beach at Cannes Lions 2026: Full Lineup of Talks and Performances

Spotify 2026 Francophone Report: €319m Royalties, Half From Outside France

148 Million Global Listeners Tune Into French-Language Content on Spotify in 2025

Arabic Music’s Global Growth Is Being Driven by Diaspora Communities, Not Regional Labels

Spotify Launches Direct Video Upload Beta for Artists

The music tech brief, no fluff.