Python Forum

Full Version: mongodb beatles data scraping question
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
As a hobbyist I wanted to scrape data and store it in mongodb and that has worked fine. My focus was to get data related to each Beatles song and so I have data fields such as song title, date recorded, lyrics etc all in one collection. But now I want to scrape data where a Beatle is quoted (during press conferences) but the data is not necessarily related to a particular song- but it might be. So I think I need two different collections. My question is: how might I go about finding connections between the two collections when perhaps I don't see any. Maybe there are some specials tools for this?