Skip to content

Commit 0c49890

Browse files
committed
Minor edits for typos in documentation.
1 parent 8d5870d commit 0c49890

File tree

5 files changed

+17
-6
lines changed

5 files changed

+17
-6
lines changed

DOCUMENTATION.md

+11
Original file line numberDiff line numberDiff line change
@@ -28,6 +28,9 @@ key component that can help in that task. Additionally, if you need to change
2828
the underlying data that is being matched for example by adding a new
2929
country to the standardizer or adding an new reference set, then this document
3030
will help explain the how it works and what you can do to make those changes.
31+
If you work with PostgreSQl database this provides and extension to the database
32+
and SQl wrappers for accessing this code. Alternatively for developer you can
33+
link this code directly into your C/C++ application and access it from there.
3134

3235
## How Does It Work?
3336

@@ -627,4 +630,12 @@ global changes (ie: ones related to may addresses or a class of addresses)
627630
making adjustments to the grammar is better. I typically look at 1,000s if not
628631
1,000,000s of records when building a new Lexicon and Grammar.
629632

633+
## See Also
634+
635+
* [Overview](docs/overview.md)
636+
* [Token](docs/token.md)
637+
* [Lexicon](docs/lexicon.md)
638+
* [Grammar](docs/grammar.md)
639+
* [InClass::Type](docs/inclass.md)
640+
* [OutClass::Type](docs/outclass.md)
630641

docs/grammar.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -22,10 +22,10 @@ need to account for things like optional fields, maybe apartment numbers, etc.
2222

2323
The first example might match many addresses in the US and Canada. The USPS has
2424
a document Pub. 28 that defines valid address structure and acceptable
25-
abbreviations the can be used to build the Lexicon.
25+
abbreviations that can be used to build the Lexicon.
2626

2727
The second example might be used at the top level to match some European
28-
country addresses where the house number follows the streat name.
28+
country addresses where the house number follows the street name.
2929

3030
The third example might be used by someone that is only working with a small
3131
subset of data covering a single city where there is no need to differentiate
@@ -59,7 +59,7 @@ tokens.
5959
The second example ``[house_number]`` has a different format and here we have
6060
to consume input tokens, ie: the tokens before the first '->'.
6161

62-
The ``[ADDRESS]`` style section is referred to a *MetaSection* and the
62+
The ``[ADDRESS]`` style section is referred to as a *MetaSection* and the
6363
``[house_number]`` section is referred to as a *RuleSection*. You can not mix
6464
*MetaSection* lines with *RuleSection* lines. In the *MetaSection* in the
6565
grammar file all references are preceded by an '@' character, so the example

docs/inclass.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@ The list of input types is:
6262
| ATT_PRE | Attached prefix |
6363
| ATT_SUF | Attached suffix |
6464

65-
You can use *DETACHED* or leave the field empty to indicate that the the token is not attached. *DET_PRE*, *DET_SUF*, *ATT_PRE*, *ATT_SUF* can be placed in a comma separated list ass appropriate to indicate the possible attachments for the token. For example *STRASSE* could have *DET_SUF,ATT_SUF* associated with it to indcate that it follows the street name and might be attached or not.
65+
You can use *DETACHED* or leave the field empty to indicate that the the token is not attached. *DET_PRE*, *DET_SUF*, *ATT_PRE*, *ATT_SUF* can be placed in a comma separated list as appropriate to indicate the possible attachments for the token. For example *STRASSE* could have *DET_SUF,ATT_SUF* associated with it to indcate that it follows the street name and might be attached or not.
6666

6767

6868
## See Also

docs/outclass.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ The full list of OutClass types is listed in the following table.
1818
| ------------- | ----------- |
1919
| BLDNG | Unparsed building identifiers and types. |
2020
| HOUSE | The civic address number. The '123' in '123 MAIN ST' |
21-
| PREDIR | A directional modifier that precedes the street name. Example, the 'WEST' in '3715 TENTH AVENUE WEST'. |
21+
| PREDIR | A directional modifier that precedes the street name. Example, the 'WEST' in '3715 WEST TENTH AVENUE'. |
2222
| QUALIF | A pre or post modifier. Example, the 'OLD' in '3715 OLD HIGHWAY 99' or 'BUSINESS' in '1203 HIGHWAY 45 BUSINESS'. |
2323
| PRETYP | A street type preceding the root street name. Example, the 'HIGHWAY' in '3715 HIGHWAY 99'. |
2424
| STREET | The root street name, stripped of directional or type modifiers. Example, the 'TENTH' in '3715 WEST TENTH AVENUE'. |

docs/token.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ example, *ST* can be classified as *TYPE* in the context of *STREET* or as
1111
against the grammar are we able to determine which context is most likely to be
1212
the context of usage for this particular address.
1313

14-
The Token object as the following fields:
14+
The Token object has the following fields:
1515

1616
* WORD - a word or phrase parsed from the address string
1717
* STDWORD - a standardized version of *WORD* based on the Lexicon or *WORD*

0 commit comments

Comments
 (0)