You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
probably the structure is not in the training data for Czech Republic, this works and is closer to the addresses it trains on:
"HVB BANK CZECH REPUBLIC A.S. Namesti Republicky 2090/3a PRAHA CZECH REPUBLIC"
You can programmatically check for if you get, for instance two different city tokens or a company suffix as a city and then try rearranging and reparse, or if the pattern happens often can just concat "a.s." when it's encountered as a "city" onto the previous "house" string. Model parses are never going to be perfect in every case so often good to encode some guardrails or forward the few that seem to have logical inconsistencies (like two different city tokens) for manual review.
I am using the Docker image https://hub.docker.com/r/clicksend/libpostal-rest
For CZECH REPUBLIC address parsing - Abbreviation in House Name is parsed wrongly as "City"
curl -X POST -d '{"query": "Namesti Republicky 2090/3a PRAHA HVB BANK CZECH REPUBLIC A.S. CZECH REPUBLIC"}' localhost:8080/parser
[{"label":"road","value":"namesti republicky"},
{"label":"house_number","value":"2090/3a"},
{"label":"city","value":"praha"},
{"label":"house","value":"hvb bank czech republic"},
{"label":"city","value":"a.s."},
{"label":"country","value":"czech republic"}]
The text was updated successfully, but these errors were encountered: