Matching Flickr Photos to City Districts in Vienna

Flickr is an easy-to-access data source for user-generated pictures. The pictures contain textual data, e.g. the title, a description or comments. Furthermore, using the transformer package and a huggingface model, it is possible to generate a short description of a picture.

Apart from that, districts in Vienna also have associated textual data. This could be for example their wikipedia entries or descriptions of points of interest in each district.

Using spacy and gensim Doc2Vec, we can use all of these textual clues to create an embedding space. Then for each pictures texts the district with the highest similarity associated with the picture. We can verify the experiment by using the geographic coordinates associated with the pictures.

A more complex apporach with using a random subset of the geotagged pictures as training data first is explored at the end.

See the Jupyter Notebook for a guide through the code

Data Sources

Viennese boundaries and the POI information is from data.gv.at.

Pictures are from Flickr, queried via API.

The model to generate image captions is from huggingface

The concrete model is Salesforce/blip-image-captioning-large

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
data		data
Flickr_analysis_final.ipynb		Flickr_analysis_final.ipynb
Project_Presentation.pdf		Project_Presentation.pdf
README.md		README.md
result_matrix_with_flickr_training.png		result_matrix_with_flickr_training.png
result_matrix_without_flickr_training.png		result_matrix_without_flickr_training.png
results_no_flickr_training_f1Score.png		results_no_flickr_training_f1Score.png
results_no_flickr_training_precision.png		results_no_flickr_training_precision.png
results_no_flickr_training_recall.png		results_no_flickr_training_recall.png
results_with_flickr_trainingf1Score.png		results_with_flickr_trainingf1Score.png
results_with_flickr_trainingprecision.png		results_with_flickr_trainingprecision.png
results_with_flickr_trainingrecall.png		results_with_flickr_trainingrecall.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Matching Flickr Photos to City Districts in Vienna

See the Jupyter Notebook for a guide through the code

Data Sources

About

Releases

Packages

Languages

simon-gross/flickr_and_nlp

Folders and files

Latest commit

History

Repository files navigation

Matching Flickr Photos to City Districts in Vienna

See the Jupyter Notebook for a guide through the code

Data Sources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages