Skip to content

Commit 62c95d0

Browse files
committed
Add info about dataset miners
1 parent 42b6c8a commit 62c95d0

File tree

1 file changed

+8
-0
lines changed

1 file changed

+8
-0
lines changed

README.md

+8
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,14 @@ You can easily install model through the PIP:
1414
pip install code2seq
1515
```
1616

17+
## Dataset mining
18+
19+
To prepare your own dataset with a storage format supported by this implementation, use on the following:
20+
1. Original dataset preprocessing from vanilla repository
21+
2. [`astminer`](https://github.com/JetBrains-Research/astminer):
22+
the tool for mining path-based representation and more with multiple language support.
23+
3. [`PSIMiner`](https://github.com/JetBrains-Research/psiminer):
24+
the tool for extracting PSI trees from IntelliJ Platform and creating datasets from them.
1725
## Available checkpoints
1826

1927
### Method name prediction

0 commit comments

Comments
 (0)