Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for Audio and Video summarization to Docsum #865

Open
wants to merge 33 commits into
base: main
Choose a base branch
from

Conversation

MSCetin37
Copy link

@MSCetin37 MSCetin37 commented Nov 7, 2024

Description

Extend the current Document Summarization Application by incorporating video and audio summary features. This enhancement will enable the application to summarize video and audio content in addition to text documents, thereby broadening its utility and applicability.

Issues

https://github.com/opea-project/docs/blob/main/community/rfcs/24-06-21-OPEA-001-DocSum_Video_Audio.md

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)
  • Others (enhancement, documentation, validation, etc.)

Tests

Whisper Service

Run the following command to validate the Whisper Service:

python comps/asr/whisper/dependency/check_whisper_server.py 

Expected output:

{'asr_result': 'who is pat gelsinger'}

Audio2Text Service

Run the following command to validate the Audio2Text Service:

python comps/dataprep/multimedia2text/audio2text/check_a2t_server.py

Expected output:

{'downstream_black_list': [], 'id': '21b0459477abea6d85d20f4b5ddcb714', 'query': 'who is pat gelsinger'}

Note: The id value will be different.

Video2Audio Service

Run the following command to validate the Video2Audio Service:

python comps/dataprep/multimedia2text/video2audio/check_v2a_microserver.py

Expected output:

========= Audio file saved as ======
comps/dataprep/multimedia2text/video2audio/converted_audio.wav
====================================

Multimedia2Text Service

Run the following command to validate the Multimedia2Text Service:

python comps/dataprep/multimedia2text/check_multimedia2text.py 

Expected output:

Running test: Whisper service
>>> Whisper service Test Passed ... 

Running test: Audio2Text service
>>> Audio2Text service Test Passed ... 

Running test: Video2Text service
>>> Video2Text service Test Passed ... 

Running test: Multimedia2text service
>>> Multimedia2text service test for text data type passed ... 
>>> Multimedia2text service test for audio data type passed ... 
>>> Multimedia2text service test for video data type passed ... 

Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
Signed-off-by: Mustafa <[email protected]>
@ashahba ashahba self-assigned this Nov 8, 2024
@ashahba ashahba self-requested a review November 8, 2024 00:00
Copy link

codecov bot commented Nov 8, 2024

Codecov Report

Attention: Patch coverage is 86.66667% with 2 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
comps/cores/mega/gateway.py 33.33% 2 Missing ⚠️
Files with missing lines Coverage Δ
comps/cores/proto/api_protocol.py 96.06% <100.00%> (+0.05%) ⬆️
comps/cores/proto/docarray.py 99.42% <100.00%> (+0.02%) ⬆️
comps/cores/mega/gateway.py 31.12% <33.33%> (+0.87%) ⬆️

... and 1 file with indirect coverage changes

@ashahba ashahba added this to the v1.1 milestone Nov 8, 2024
@ashahba ashahba added the WIP label Nov 8, 2024
Copy link
Collaborator

@ashahba ashahba left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

comps/asr/whisper/dependency/whisper_model.py Show resolved Hide resolved
comps/dataprep/multimedia2text/Dockerfile Show resolved Hide resolved
comps/dataprep/multimedia2text/Dockerfile Show resolved Hide resolved
comps/cores/mega/gateway.py Outdated Show resolved Hide resolved
comps/cores/mega/gateway.py Outdated Show resolved Hide resolved
comps/dataprep/multimedia2text/README.md Outdated Show resolved Hide resolved
comps/dataprep/multimedia2text/audio2text/audio2text.py Outdated Show resolved Hide resolved
@ashahba ashahba changed the title Docsum Add support for Audio and Video summarization to Docsum Nov 8, 2024
.github/workflows/docker/compose/dataprep-compose-cd.yaml Outdated Show resolved Hide resolved
comps/dataprep/multimedia2text/README.md Outdated Show resolved Hide resolved
comps/dataprep/multimedia2text/audio2text/Dockerfile Outdated Show resolved Hide resolved
comps/dataprep/multimedia2text/audio2text/audio2text.py Outdated Show resolved Hide resolved
comps/dataprep/multimedia2text/check_multimedia2text.py Outdated Show resolved Hide resolved
comps/dataprep/multimedia2text/multimedia2text.py Outdated Show resolved Hide resolved
comps/dataprep/multimedia2text/video2audio/video2audio.py Outdated Show resolved Hide resolved
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants