Fix incorrect PDF title display when multiple language titles present in XMP metadata (issue 20801) by nyxsky404 · Pull Request #20874 · mozilla/pdf.js

nyxsky404 · 2026-03-15T12:11:10Z

Fixes #20801

Problem

When a PDF has multiple language titles in XMP metadata using rdf:Alt, the titles were being concatenated instead of selecting a single title. For example, a PDF with both x-default and en language titles would display "Hello WorldHello World" instead of "Hello World".

Solution

Added _parseLangAlt method to properly handle rdf:Alt elements for dc:title and dc:description
Selects x-default language entry if present, otherwise uses the first entry
Falls back to textContent for backward compatibility with plain text values
Enabled hasAttributes option in SimpleXMLParser to read xml:lang attributes

Testing

Added unit tests for multiple language titles with x-default
Added unit tests for multiple language descriptions without x-default
All existing metadata tests continue to pass

… in XMP metadata (issue 20801)

codecov-commenter · 2026-03-15T13:35:25Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 62.56%. Comparing base (3127492) to head (7443fab).
⚠️ Report is 110 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #20874      +/-   ##
==========================================
+ Coverage   62.51%   62.56%   +0.04%     
==========================================
  Files         173      173              
  Lines      121246   121278      +32     
==========================================
+ Hits        75796    75872      +76     
+ Misses      45450    45406      -44

Flag	Coverage Δ
fonttest	`7.66% <ø> (?)`
unittestcli	`62.53% <100.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

themavik

Turning on hasAttributes for SimpleXMLParser and routing dc:title/dc:description through _parseLangAlt fixes the concatenated multi-lang titles cleanly. nit: _parseLangAlt grabs entry.childNodes[0] before _getSequence—if a whitespace text node sneaks ahead of rdf:Alt you might still fall back to odd textContent; worth a regression if you see real-world XMP like that.

nyxsky404 · 2026-03-23T08:22:13Z

Makes sense, I’ll add a regression test for that case

Fix incorrect PDF title display when multiple language titles present…

bca2dbd

… in XMP metadata (issue 20801)

timvandermeij added the core label Mar 15, 2026

nyxsky404 added 2 commits March 15, 2026 20:53

Merge branch 'mozilla:master' into issue20801

86df3a9

Add tests for empty metadata elements to improve coverage

7443fab

themavik reviewed Mar 23, 2026

View reviewed changes

Handle whitespace before rdf:Alt in metadata

f8b34e3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incorrect PDF title display when multiple language titles present in XMP metadata (issue 20801)#20874

Fix incorrect PDF title display when multiple language titles present in XMP metadata (issue 20801)#20874
nyxsky404 wants to merge 4 commits intomozilla:masterfrom
nyxsky404:issue20801

nyxsky404 commented Mar 15, 2026

Uh oh!

codecov-commenter commented Mar 15, 2026 •

edited

Loading

Uh oh!

themavik left a comment

Uh oh!

nyxsky404 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

nyxsky404 commented Mar 15, 2026

Problem

Solution

Testing

Uh oh!

codecov-commenter commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

themavik left a comment

Choose a reason for hiding this comment

Uh oh!

nyxsky404 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov-commenter commented Mar 15, 2026 •

edited

Loading