[SPARKNLP-1068] Introducing BLIPForQuestionAnswering transformer #14422

danilojsl · 2024-10-02T23:02:44Z

Description

This pull request introduces the BLIPForQuestionAnswering transformer, enabling enhanced image-based question-answering capabilities.

Usage Instructions
To utilize this new transformer, a DataFrame with the following structure is required:

image column: Contains the file paths for each image within the directory.
text column: Includes the specific question you would like to ask about each corresponding image.

Enhance Spark NLP with visual transformer capabilities.

Bug fix (non-breaking change which fixes an issue)
Code improvements with no or little impact
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to change)

…swering

[SPARKNLP-1068] Introducing BLIPForQuestionAnswering transformer

7adf658

danilojsl requested review from maziyarpanahi and prabod October 2, 2024 23:03

danilojsl added Feature request enhancement DON'T MERGE Do not merge this PR labels Oct 2, 2024

danilojsl added 5 commits October 2, 2024 18:09

[SPARKNLP-1068] Adding BLIPForQuestionAnswering import notebook example

af0c319

[SPARKNLP-1068] Fix fullAnnotateImage validation

c256e16

[SPARKNLP-1068] Solves BLIPForQuestionAnsweringTest issue

7c46662

[SPARKNLP-1068] Updates default BLIPForQuestionAnswering model name

1b4b29d

[SPARKNLP-1068] [skip test] Adding documentation to BLIPForQuestionAn…

e121763

…swering

danilojsl self-assigned this Nov 8, 2024