Replies: 1 comment 2 replies
-
There is a different between letter-based and syllable/word-based scripts here. In letter-based scripts the spaces are usually significant and will therefore not be left out. For a native writer, they are much more significant then upper vs. lower case. So with your example '5900 Alderbridge Way' people will regularly query for '5900 alderbridge way' but never for '5900AlderbridgeWay'. With that in mind, Nominatim ignores upper vs. lower case but considers spaces mandatory. This mostly works well with two exceptions: we do have special treatment for houenumbers where spaces are used more liberally, and there is a kind of marketing speech emerging which favours CamelCase over using spaces (see #3578). Your suggestion might have merit for the latter. However, the preprocessor would probably not be the base place to handle this. The preprocessor can only make hard this-or-that decisions: given 'AudioDesign', search either for 'AudioDesign' or for 'Audio-Design'. What we'd might want to do is searching for both. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @lonvia, it is my first time to touch with osm from GSOC and I am trying to play with it! I searched some addresses in vancouver and found something does not work as I expect, for example I searched '5900AlderbridgeWay', there was showing with no result. And if I searched '5900 Alderbridge Way', the correct address was showed up. I thought the reason is the preprocessor did not parse the query correctly. Like it did not insert spaces between numbers and letters and it also did not automatically separate house number, street name, and street type (Way, St, Ave). If you correctly do in this way, you can get the correct address showed up on the map. I thought this problem is relevant with the latest issue 'Add query preprocessor for using regular expressions on queries'. I can add a space between the number and the letter, and add a space between street name and its type(the rule in query-processing should be like pattern: ([a-zA-Z]+)(Way|St|Street|Ave|Avenue|Blvd|Boulevard|Rd|Road) replace: \1 \2 .
In japanese preprocessor, it played well! It can efficiently recognize like 都都道府県縣 and 市区區町村. So if you search 東京都新宿区西新宿, after processing, it will be 東京都:新宿区:西新宿. So the map can recognize it and show the correct address up on the map.
It is my small thought after playing with the osm and I am really into the map stuff. I like it! Thank you for guidance and time! I appreciate so much!
Beta Was this translation helpful? Give feedback.
All reactions