Recognizing Underlying Patterns in Categorical Data via Symbolization and Masking Mechanisms

- Depending on your transformer toolkit versions, the transformer import code may need to be adjusted, like as follows:
+ from transformers.modeling_bert import BertPreTrainedModel, BertPooler
+ --> from transformers.models.bert.modeling_bert import BertPreTrainedModel, BertPooler
- (Please check your transformer toolikt, and update the import code accordingly.)

How to run the code?

After downloading the code, you can run

python3 run.py

directly for categorical clustering. We suggest adjusting the hyperparameters multiple times to achieve better results.

What are the scripts used for?

(1)LM/BertForMaskedLM: Contains the model structure and configuration of the BERT.

(2)make_dataset: Data processing. Help us prepare the training set.

(3)models: Define the network structure of SAMM.

(4) utils: Contains functions for data processing and model evaluation.

Several toolkits may be needed to run the code

(1) pytorch (https://anaconda.org/pytorch/pytorch)

(2) sklearn (https://anaconda.org/anaconda/scikit-learn)

(3) transformers (https://anaconda.org/conda-forge/transformers)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Recognizing Underlying Patterns in Categorical Data via Symbolization and Masking Mechanisms

How to run the code?

What are the scripts used for?

Several toolkits may be needed to run the code

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
LM/BertForMaskedLM		LM/BertForMaskedLM
make_dataset		make_dataset
model		model
utils		utils
README.md		README.md
datas.zip		datas.zip
run.py		run.py

kcisgroup/SAMM

Folders and files

Latest commit

History

Repository files navigation

Recognizing Underlying Patterns in Categorical Data via Symbolization and Masking Mechanisms

How to run the code?

What are the scripts used for?

Several toolkits may be needed to run the code

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages