Web Data Extraction Examples
Task:
Create a database of 1 million world's most popular songs (title, artist and popularity fields) based on Last.fm's statistical data

We started by crawling Last.fm's website, gathering the artist names and put the data to the first database. There were found about 700,000 artists and bands. Then we've collected and put to another database all the listed songs with the popularity data ("plays in last 6 months") for each artist.
The result: We extracted all the data and compiled an excel file with 1 million most popular songs.

Copyright © 2009 DATA89.com