mirror of
https://github.com/zebrajr/imdbscrapper.git
synced 2025-12-06 00:20:21 +01:00
Scrapper to get movies and shows information from IMDB.
| src | ||
| storage | ||
| docker-compose.yml | ||
| Dockerfile | ||
| LICENSE | ||
| README.md | ||
| requirements.txt | ||
imdbscrapper
Scrapper to get movies information from IMDB, indexing it into movies and shows, with rating, release date, and a few more information.
Situation
Finding movies / shows to watch, based on ratings and release date. This search and notes would have to be done manually.
Task
Create a way to automatically index entries from movies (IMDB), so they can be searched and filtered afterwards via common software (Spreadsheet)
Action
- With Docker
docker build -t yourUser/yourPackage:yourVersion .
- Directly
Install the requirements described in requirements.txt (pip3 install -r requirements.txt) Create the folder structure or edit the settings in the main script
python3 scrapper.yml
Result
| File | Content |
|---|---|
| movies.csv | CSV file with all movies indexed |
| series.csv | CSV file with all shows indexed |
| info.log | Any errors occured. Change the debug level if you want to log info messages |
| counter.txt | The last indexed url. Needed to continue in case the script is interrupted |
Note
ToDo
- Add Error Handling in case Internet is not available
- Add possibility to re-index failed entries (to go though the indexer faster when a new movie/show is added)