Add support for Audio and Video summarization to Docsum #865

MSCetin37 · 2024-11-07T22:55:29Z

Description

Extend the current Document Summarization Application by incorporating video and audio summary features. This enhancement will enable the application to summarize video and audio content in addition to text documents, thereby broadening its utility and applicability.

Issues

https://github.com/opea-project/docs/blob/main/community/rfcs/24-06-21-OPEA-001-DocSum_Video_Audio.md

Type of change

List the type of change like below. Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds new functionality)
Breaking change (fix or feature that would break existing design and interface)
Others (enhancement, documentation, validation, etc.)

Tests

Whisper Service

Run the following command to validate the Whisper Service:

python comps/asr/whisper/dependency/check_whisper_server.py

Expected output:

{'asr_result': 'who is pat gelsinger'}

Audio2Text Service

Run the following command to validate the Audio2Text Service:

python comps/dataprep/multimedia2text/audio2text/check_a2t_server.py

Expected output:

{'downstream_black_list': [], 'id': '21b0459477abea6d85d20f4b5ddcb714', 'query': 'who is pat gelsinger'}

Note: The id value will be different.

Video2Audio Service

Run the following command to validate the Video2Audio Service:

python comps/dataprep/multimedia2text/video2audio/check_v2a_microserver.py

Expected output:

========= Audio file saved as ======
comps/dataprep/multimedia2text/video2audio/converted_audio.wav
====================================

Multimedia2Text Service

Run the following command to validate the Multimedia2Text Service:

python comps/dataprep/multimedia2text/check_multimedia2text.py

Expected output:

Running test: Whisper service
>>> Whisper service Test Passed ... 

Running test: Audio2Text service
>>> Audio2Text service Test Passed ... 

Running test: Video2Text service
>>> Video2Text service Test Passed ... 

Running test: Multimedia2text service
>>> Multimedia2text service test for text data type passed ... 
>>> Multimedia2text service test for audio data type passed ... 
>>> Multimedia2text service test for video data type passed ...

Signed-off-by: Mustafa <[email protected]>

codecov · 2024-11-08T00:03:30Z

Codecov Report

Attention: Patch coverage is 86.66667% with 2 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
comps/cores/mega/gateway.py	33.33%	2 Missing ⚠️

Files with missing lines	Coverage Δ
comps/cores/proto/api_protocol.py	`96.06% <100.00%> (+0.05%)`	⬆️
comps/cores/proto/docarray.py	`99.42% <100.00%> (+0.02%)`	⬆️
comps/cores/mega/gateway.py	`31.12% <33.33%> (+0.87%)`	⬆️

... and 1 file with indirect coverage changes

Signed-off-by: Mustafa <[email protected]>

ashahba

Thanks @MSCetin37 and @HarshaRamayanam

comps/asr/whisper/dependency/whisper_model.py

comps/dataprep/multimedia2text/Dockerfile

comps/cores/mega/gateway.py

comps/dataprep/multimedia2text/README.md

comps/dataprep/multimedia2text/audio2text/audio2text.py

comps/dataprep/multimedia2text/audio2text/check_a2t_server.py

comps/dataprep/multimedia2text/video2audio/video2audio.py

comps/dataprep/multimedia2text/video2audio/video2audio_microservice.py

Signed-off-by: Mustafa <[email protected]>

.github/workflows/docker/compose/dataprep-compose-cd.yaml

comps/dataprep/multimedia2text/README.md

comps/dataprep/multimedia2text/audio2text/Dockerfile

comps/dataprep/multimedia2text/audio2text/audio2text.py

comps/dataprep/multimedia2text/audio2text/check_a2t_server.py

comps/dataprep/multimedia2text/check_multimedia2text.py

comps/dataprep/multimedia2text/multimedia2text.py

comps/dataprep/multimedia2text/video2audio/video2audio.py

comps/dataprep/multimedia2text/video2audio/video2audio_microservice.py

Signed-off-by: Mustafa <[email protected]>

for more information, see https://pre-commit.ci

MSCetin37 added 25 commits October 9, 2024 12:11

v2a services

6c14008

Signed-off-by: Mustafa <[email protected]>

add a2t - llm

1b28def

Signed-off-by: Mustafa <[email protected]>

update whisper serve

8295cd6

Signed-off-by: Mustafa <[email protected]>

updates

6a44c5e

Signed-off-by: Mustafa <[email protected]>

add data service

80ac6a5

Signed-off-by: Mustafa <[email protected]>

gateway

89e76c7

Signed-off-by: Mustafa <[email protected]>

clean gateway & orchestrator

0ff5083

Signed-off-by: Mustafa <[email protected]>

updates

22a6516

Signed-off-by: Mustafa <[email protected]>

updates

89723ed

Signed-off-by: Mustafa <[email protected]>

adding functional tests

4907fc1

Signed-off-by: Mustafa <[email protected]>

updates

6cff4b2

Signed-off-by: Mustafa <[email protected]>

updates

afbbbde

Signed-off-by: Mustafa <[email protected]>

updates read me file

b765da5

Signed-off-by: Mustafa <[email protected]>

name changes

f84cdcb

Signed-off-by: Mustafa <[email protected]>

update readme file

f4f7d55

Signed-off-by: Mustafa <[email protected]>

update readme file

a4cb22d

Signed-off-by: Mustafa <[email protected]>

update readme file

9a9346e

Signed-off-by: Mustafa <[email protected]>

update readme file

01e0a4c

Signed-off-by: Mustafa <[email protected]>

update readme file

cce4a61

Signed-off-by: Mustafa <[email protected]>

update max token option

37112c0

Signed-off-by: Mustafa <[email protected]>

update the test files

d428d10

Signed-off-by: Mustafa <[email protected]>

readme updtes

f2440ed

Signed-off-by: Mustafa <[email protected]>

readme updtes

5b2b6cf

Signed-off-by: Mustafa <[email protected]>

merge sync

d6ee02f

Signed-off-by: Mustafa <[email protected]>

clean code

a2a8c86

Signed-off-by: Mustafa <[email protected]>

MSCetin37 requested a review from lvliang-intel as a code owner November 7, 2024 22:55

ashahba self-assigned this Nov 8, 2024

ashahba self-requested a review November 8, 2024 00:00

ashahba added this to the v1.1 milestone Nov 8, 2024

ashahba added the WIP label Nov 8, 2024

MSCetin37 added 2 commits November 7, 2024 22:22

update dataprep-compose-cd.yaml file

f402ba3

Signed-off-by: Mustafa <[email protected]>

merge and sync

6e10b4f

Signed-off-by: Mustafa <[email protected]>

MSCetin37 force-pushed the docsum branch from 2bfb4a3 to 6e10b4f Compare November 8, 2024 06:26

MSCetin37 added 2 commits November 7, 2024 22:30

merge and sync

ab61f95

Signed-off-by: Mustafa <[email protected]>

merge and sync gateway

ff3ef0e

Signed-off-by: Mustafa <[email protected]>

ashahba requested changes Nov 8, 2024

View reviewed changes

adding the copyright header

e8cd092

Signed-off-by: Mustafa <[email protected]>

ashahba changed the title ~~Docsum~~ Add support for Audio and Video summarization to Docsum Nov 8, 2024

ashahba requested changes Nov 8, 2024

View reviewed changes

update the end of file char

bba8404

Signed-off-by: Mustafa <[email protected]>

MSCetin37 force-pushed the docsum branch from edc1624 to bba8404 Compare November 8, 2024 20:06

pre-commit-ci bot and others added 2 commits November 8, 2024 20:07

[pre-commit.ci] auto fixes from pre-commit.com hooks

51d2784

for more information, see https://pre-commit.ci

Merge branch 'main' into docsum

6a87761

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Audio and Video summarization to Docsum #865

Add support for Audio and Video summarization to Docsum #865

MSCetin37 commented Nov 7, 2024 •

edited

Loading

codecov bot commented Nov 8, 2024 •

edited

Loading

ashahba left a comment

Add support for Audio and Video summarization to Docsum #865

Are you sure you want to change the base?

Add support for Audio and Video summarization to Docsum #865

Conversation

MSCetin37 commented Nov 7, 2024 • edited Loading

Description

Issues

Type of change

Tests

Whisper Service

Audio2Text Service

Video2Audio Service

Multimedia2Text Service

codecov bot commented Nov 8, 2024 • edited Loading

Codecov Report

ashahba left a comment

Choose a reason for hiding this comment

MSCetin37 commented Nov 7, 2024 •

edited

Loading

codecov bot commented Nov 8, 2024 •

edited

Loading