-
-
Notifications
You must be signed in to change notification settings - Fork 0
Update dependency unstructured to v0.18.9 #410
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
Important Review skippedBot user detected. To trigger a single review, invoke the You can disable this status message by setting the 🪧 TipsChatThere are 3 ways to chat with CodeRabbit:
SupportNeed help? Join our Discord community for assistance with any issues or questions. Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments. CodeRabbit Commands (Invoked using PR comments)
Other keywords and placeholders
CodeRabbit Configuration File (
|
4505261
to
56bf381
Compare
56bf381
to
284c623
Compare
0a9ec4e
to
93b3ecd
Compare
93b3ecd
to
f61e586
Compare
f61e586
to
bfc6856
Compare
This PR contains the following updates:
==0.17.2
->==0.18.9
Release Notes
Unstructured-IO/unstructured (unstructured)
v0.18.9
Compare Source
Enhancements
Features
Fixes
v0.18.7
Compare Source
Enhancements
text_as_html
for Table element now keeps bothinput
andimg
tag'sclass
attribute Previously in partition HTML any tag inside a table is stripped of itsclass
attribute. Now this attribute is preserved for bothinput
andimg
tag in the table element'smetadata.text_as_html
.Features
Fixes
v0.18.6
Compare Source
Enhancements
Features
Fixes
TableChunk
for the string value of the fieldtype
when serializing elements of typeTableChunk
, rather than using the valueTable
.v0.18.5
Compare Source
Enhancements
text_as_html
for Table element now keepsimg
tag'sclass
attribute Previously in partition HTML any tag inside a table is stripped of itsclass
attribute. Now this attribute is preserved forimg
tag in the table element'smetadata.text_as_html
.Features
Fixes
v0.18.3
Compare Source
Enhancements
Features
Fixes
v0.18.2
Compare Source
Enhancements
Features
Fixes
tc_at_grid_offset
and raisedValueError: no tc element at grid_offset=X
.partition_md
reads the file as utf-8 previously. Now it usesread_txt_file
that reads file with detected encoding.UncategorizedText
or as the nested structure likeTitle
. Now they are properly partitioned asHeader
andFooter
element types.v0.18.1
Compare Source
Enhancements
Features
Fixes
encoding
property of the_CsvPartitioningContext
is now properly used.Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.