Make script converting tokenized corpus contents to non-tokenized versions

Make standing XSLT script that converts tokenized corpus documents to non-tokenized ones and copying only contents

This can then be the basis for a 3rd main variety of every corpus document in which we will store and edit IGT glosses

The following can be the other outputs:

### 1) Basic structure with original Mixtec(non-tokenized), English and Spanish sentence translations

```
<seg xml:id="d1e140a" n="3" xml:lang="mix" resp="#TS" type="S">Nikitsi Shanty ka tsi mee ncha aueroperto S.F </seg>
<spanGrp type="annotations">
 Shanty came with me to the S.F airport.
 Shanty vino conmigo al aeropuerto de S.F.
</spanGrp>
```

This can then be copied (in a slightly modified XSLT) and (mostly) manually edited to become:

### 2) IGT centered data structure
```
<seg xml:id="d1e140igt" n="3" xml:lang="mix" resp="#TS" type="IGT">Ni-kits-i Shanty=ka tsi mee ncha aueroperto S.F.</seg>
<spanGrp type="annotations">
 PFV-come-3s Shanty=TPC with PRON-EMPH.1s ADPOS.until S.F airport 
 Shanty came with me to the S.F airport.
 Shanty vino conmigo al aeropuerto de S.F 
</spanGrp>
```
- Important note: I will need to create proper typology for the values of `//seg` to express that it is both the sentence (#S) and segmented as an interlinear glossed text (#IGT) and for the value of the `//span` that contains the interlinear glosses corresponding to that `//seg`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make script converting tokenized corpus contents to non-tokenized versions #112

1) Basic structure with original Mixtec(non-tokenized), English and Spanish sentence translations

2) IGT centered data structure

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Make script converting tokenized corpus contents to non-tokenized versions #112

Description

1) Basic structure with original Mixtec(non-tokenized), English and Spanish sentence translations

2) IGT centered data structure

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions