PyData London 2023

Language Models for Music Recommendation
06-03, 15:45–16:25 (Europe/London), Minories

Music streaming services like Spotify and youtube are famous for their recommendation systems and each service takes a unique approach to recommending and personalize content. While most users are happy with the recommendations provided, there are a section of users who are curious how and why a certain track is recommended. Complex recommendation systems take various factors like track metadata, user metadata, and play counts along with the track content itself.

Inspired by Andrej Karpathy to build an own GPT, we have to use Language Models to build our own music recommendation system.


Music streaming services like spotify and youtube are famous for their recommendation systems and each service takes a unique approach to recommend and personalize content. While most users are happy with the recommendations provided, there are a section of users who are curious how and why a certain track is recommended. Complex recommendation systems take various factors like track metadata, user metadata, play counts along with the track content itself.

As music aficionados who love techno, trance, deep house and classical genres, we want to understand the following questions:

  1. Can we analyze the signals from the song track and identify the different instruments used?
  2. How can we create embeddings of all the tracks and index these for further analysis?
  3. How do we create a simple User interface to pick a song track, retrieve relevant embeddings from a section of the track and get recommendations based on just the music content.
  4. As a side effect, can I retrieve similar sections of music across various tracks.
  5. Using audio LMs, can we generate high quality music based of the embeddings?

Prior Knowledge Expected

No previous knowledge expected

Nischal is currently playing the role of Vice President of Data and ML at scoutbee, a company based out of Berlin, that is operating the space of procurement.

Having worked in the industry over the last 12+ years across enterprise companies and startups, Nischal has had the privilege of building and being part of teams that have designed and implemented data engineering and data science products to solve hard problems. Understanding the challenges of building data systems with Machine learning at the helm of it and taking them from research to production has been a fascinating and rewarding experience for Nischal.

Raghotham currently leads NLP and computer vision teams at PayPal. Previously, he has built and led ML teams from scratch for various small and large enterprises.