Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tv_imdb #123

Closed
Chris230291 opened this issue Nov 18, 2020 · 7 comments
Closed

tv_imdb #123

Chris230291 opened this issue Nov 18, 2020 · 7 comments

Comments

@Chris230291
Copy link

I have been looking at the tv_imdb filter and was wondering if it can modify an existing xml file and replace all titles with the original title and perhaps the description too?

I tried to set it up and have a play but it seems to be broken? The --prepStage all --download step fails because of bad links I think. After a quick google I appear to have found the links here https://datasets.imdbws.com/

Can anyone help?

Thanks,
Chris

@knowledgejunkie
Copy link
Contributor

Please also refer to #17

@Chris230291
Copy link
Author

Thanks for the reply.
So it has not been updated to support the new format yet?

Assuming it was working, would it do what I want?

@knowledgejunkie
Copy link
Contributor

According to the manpage and the code in lib/IMDB.pm, tv_imdb will replace the title with that found in the IMDB data if a match can be made, and will also replace the description field if the --with-plot option is specified when running tv_imdb.

tv_imdb has not been updated for a long time but if Amazon are making up-to-date information available again for non-commercial use this will likely trigger more development (and/or patches!)

@Chris230291
Copy link
Author

OK. The data at https://datasets.imdbws.com/ is up to date. That topic you linked seems to suggest that the data has not been updated since 2017, and wont be anymore, but that is incorrect.

@honir
Copy link
Contributor

honir commented Nov 19, 2020

There are two (official) data sources for IMDb data: 1) the ftp files we have been using for the past 23 years, 2) the new TSV files.

The FTP files in (1) have not been updated since Dec. 2017, and Amazon have said they will not be updated anymore.

The TSV files in (2) are updated daily but these files are very different in content to (1), with much data no longer available.

Notably they contain only the top 3 genres and top 3 actors, and do not contain any film descriptions.

Compare https://www.imdb.com/interfaces/ with ftp://ftp.fu-berlin.de/pub/misc/movies/database/frozendata/ and you will see how minimalist the 'new' files are.

@honir
Copy link
Contributor

honir commented Nov 19, 2020

The usual reason the tv_imdb building of local database fails is because of Perl running out of memory. Your best bet is to download the files manually and then run each stage one at a time, to avoid Perl memory errors.

@honir
Copy link
Contributor

honir commented Jan 7, 2021

tv_imdb has been changed (1) (2) and now uses significantly less memory. The download links used by the program are correct. So you should have no problem building your local database now.

Your initial question was answered above.

@honir honir closed this as completed Jan 7, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants