How can I annotate/caption the image and display it when exporting it to markdown or text file? #256

sunwoongc · 2024-11-06T08:55:26Z

Question

I want to add a captions to the PictureItem and display it on the markdown or text instead of image itself or image placeholder.
In this case, should I add an annotation for each PictureItem as below:

from docling_core.types.doc.document import PictureItem, PictureDescriptionData
picture_items = []
picture_count = 1
for item, level in conv_result.document.iterate_items():    
    if isinstance(item, PictureItem):
        print("Picture")
        item.annotations.append(
            PictureDescriptionData(
                provenance = "sample",
                text=f"This is a sample annotations for picture #{picture_count}"
            )
        )
        picture_count += 1
        picture_items.append(item)

or should I implement a custom BaseEnrichmentModel or custom class inherited from ImageRef?
I think I can develop some mode for ImageRefMode like ImageRefMode.LLM

The text was updated successfully, but these errors were encountered:

PeterStaar-IBM · 2024-11-06T09:14:30Z

@sunwoongc If I understand you correctly, you want to add a caption to the figure. In general, this would be done in this way,

fig_caption = doc.add_text(
                label=DocItemLabel.CAPTION, text=("".join(texts)).strip()
            )
doc.add_picture(
                parent=self.parents[self.level],
                caption=fig_caption,
            )

You can also inspect the backends, eg here

Let me know if this helps you (and if so, feel free to close the issue).

dolfim-ibm · 2024-11-06T09:41:19Z

I think this request is similar to what we are planning in #192.

sunwoongc · 2024-11-07T00:21:59Z

Thanks for the kind and quick reply! I'm not quite sure how to use handle_figure just yet, but I'll give it a try. Thanks!

What I'm actually aiming to do is convert an image into descriptive text that represents the image’s content. When exporting the result to markdown using .export_to_markdown(), I notice that the image is represented by a placeholder tag, . Instead of this default tag, I’d like to customize it with a descriptive text, such as This image represents ....

By the way, I found that in the export_to_document_tokens method of PictureItem, there's a section of code that adds a caption to the body.

        if add_caption and len(self.captions):
            text = self.caption_text(doc)

However, I haven't found a way to initialize self.captions and how to use the caption_text method.

PeterStaar-IBM · 2024-11-07T06:46:07Z

my proposal to you would be to make a class that inherits the DoclingDocument and write a custom export which is similar to export_to_markdown and adapts this section

sunwoongc · 2024-11-07T07:07:37Z

my proposal to you would be to make a class that inherits the DoclingDocument and write a custom export which is similar to export_to_markdown and adapts this section

Thanks I'll try it.

PeterStaar-IBM · 2024-11-11T09:26:15Z

@sunwoongc We believe this is addressed by #192. In order to avoid duplicate issues, we will close this one and move to #192. Feel free to keep an eye on it.

sunwoongc added the question Further information is requested label Nov 6, 2024

PeterStaar-IBM closed this as completed Nov 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I annotate/caption the image and display it when exporting it to markdown or text file? #256

How can I annotate/caption the image and display it when exporting it to markdown or text file? #256

sunwoongc commented Nov 6, 2024

PeterStaar-IBM commented Nov 6, 2024

dolfim-ibm commented Nov 6, 2024

sunwoongc commented Nov 7, 2024 •

edited

Loading

PeterStaar-IBM commented Nov 7, 2024

sunwoongc commented Nov 7, 2024

PeterStaar-IBM commented Nov 11, 2024

How can I annotate/caption the image and display it when exporting it to markdown or text file? #256

How can I annotate/caption the image and display it when exporting it to markdown or text file? #256

Comments

sunwoongc commented Nov 6, 2024

Question

PeterStaar-IBM commented Nov 6, 2024

dolfim-ibm commented Nov 6, 2024

sunwoongc commented Nov 7, 2024 • edited Loading

PeterStaar-IBM commented Nov 7, 2024

sunwoongc commented Nov 7, 2024

PeterStaar-IBM commented Nov 11, 2024

sunwoongc commented Nov 7, 2024 •

edited

Loading