Skip to content

Commit 6dca83c

Browse files
committed
more fixes
1 parent 0187a25 commit 6dca83c

File tree

2 files changed

+8
-2
lines changed

2 files changed

+8
-2
lines changed

batchalign/pipelines/morphosyntax/ud.py

Lines changed: 7 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -769,11 +769,17 @@ def morphoanalyze(doc: Document, retokenize:bool, skipmultilang:bool, status_hoo
769769
else:
770770
config["processors"]["mwt"] = "default"
771771

772+
if "ja" in lang:
773+
config["processors"]["tokenize"] = "combined"
774+
config["processors"]["pos"] = "combined"
775+
config["processors"]["lemma"] = "combined"
776+
config["processors"]["depparse"] = "combined"
777+
778+
772779
configs = {}
773780
for l in lang:
774781
configs[l] = config.copy()
775782

776-
777783
if len(lang) > 1:
778784
nlp = stanza.MultilingualPipeline(
779785
lang_configs = configs,

batchalign/version

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
1-
0.7.20-post.15
1+
0.7.20-post.16
22
August 05, 2025
33
some ud parsing fixes

0 commit comments

Comments
 (0)