Personality Prediction using Deep Learning

🚀 A Deep Learning-based personality classification model using LSTMs, Bi-Directional LSTMs, and BERT to predict MBTI personality types from text data.

📌 Project Overview

This project classifies MBTI personality types based on text data. It is structured into three key steps:

Data Visualization & Preprocessing – Cleaning and preparing text data.
Model Training – Training LSTM, Bi-Directional LSTM, and BERT models.
Model Evaluation – Comparing model accuracy, loss, and overall performance.

✔ LSTM Model – Sequential model with embeddings and LSTM layers
✔ Bi-Directional LSTM Model – Enhances sequence learning with bidirectional LSTMs
✔ BERT Model – Transformer-based NLP model for improved contextual understanding
✔ Performance Comparison – Evaluating all models based on accuracy and loss

🛠 Installation & Setup

1️⃣ Clone the Repository

git clone https://github.com/JaspreetSingh-exe/Personality-Prediction-Using-Deep-Learning.git
cd Personality-Prediction-Using-Deep-Learning

2️⃣ Install Dependencies

pip install -r requirements.txt

3️⃣ Run the Project

📊 Step 1: Data Visualization & Preprocessing

jupyter notebook data_visualization.ipynb

🏋️‍♂️ Step 2: Model Training

jupyter notebook training.ipynb

📈 Step 3: Model Evaluation & Comparison

jupyter notebook evaluate_model.ipynb

📂 Dataset

The dataset used for this project consists of text-based personality traits labeled according to the Myers-Briggs Type Indicator (MBTI). Each entry contains a series of posts written by a user and their corresponding personality type. The dataset is preprocessed by:

Removing stopwords and special characters to clean text data.
Tokenizing and padding sequences for uniform input.
Splitting into training and testing sets for model evaluation.

The dataset helps train models to classify personality types based on textual inputs.

📂 Project Structure

📦 Personality Prediction Using Deep Learning
├── data_visualization.ipynb      # Exploratory Data Analysis & Preprocessing
├── training.ipynb                # Model Training (LSTM, Bi-LSTM, BERT)
├── evaluate_model.ipynb          # Model Evaluation & Comparison
├── cleaned_mbti_data.csv         # Preprocessed dataset
├── README.md                     # Project Documentation
├── requirements.txt              # Dependencies List
├── model_comparison_results.csv  # Performance metrics  
├── models/
│   ├── lstm_model.h5             # Trained LSTM Model
│   ├── bilstm_model.h5           # Trained Bi-LSTM Model
│   ├── bert_model.h5             # Trained BERT Model

🤖 Understanding LSTM, Bi-Directional LSTM & BERT

🔹 What is LSTM (Long Short-Term Memory)?

LSTM is a type of Recurrent Neural Network (RNN) that is well-suited for sequential data processing.

Example Code for LSTM:

from keras.models import Sequential
from keras.layers import LSTM, Embedding, Dense, Dropout

model = Sequential([
    Embedding(input_dim=10000, output_dim=256, input_length=1500),
    LSTM(100, dropout=0.2, recurrent_dropout=0.2),
    Dense(16, activation='softmax')
])
model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=['accuracy'])

🔗 Paper Link

🔹 What is Bi-Directional LSTM?

A Bi-Directional LSTM (Bi-LSTM) processes input sequences forward and backward, improving context capture.

Example Code for Bi-Directional LSTM:

from keras.layers import Bidirectional

model = Sequential([
    Embedding(input_dim=10000, output_dim=256, input_length=1500),
    Bidirectional(LSTM(100, return_sequences=True)),
    Dropout(0.3),
    Bidirectional(LSTM(50)),
    Dense(16, activation='softmax')
])
model.compile(loss='categorical_crossentropy', optimizer='adam', metrics=['accuracy'])

🔗 Paper Link

🔹 What is BERT (Bidirectional Encoder Representations from Transformers)?

BERT is a transformer-based NLP model trained on large datasets that captures context from both left and right directions.

Example Code for BERT:

from transformers import TFBertModel, AutoTokenizer
import tensorflow as tf

tokenizer = AutoTokenizer.from_pretrained("bert-base-uncased")
bert_layer = TFBertModel.from_pretrained("bert-base-uncased")

input_word_ids = tf.keras.layers.Input(shape=(1500,), dtype=tf.int32, name="input_word_ids")
bert_outputs = bert_layer(input_word_ids)[0]
output = tf.keras.layers.Dense(16, activation="softmax")(bert_outputs[:, 0, :])

bert_model = tf.keras.models.Model(inputs=input_word_ids, outputs=output)
bert_model.compile(loss="categorical_crossentropy",
                   optimizer=tf.keras.optimizers.Adam(learning_rate=0.00001),
                   metrics=["accuracy"])

🔗 Paper Link

🏆 Model Comparison

Model	Accuracy
LSTM	25.4 %
Bi-Directional LSTM	53.0 %
BERT	85.8 %

🤝 Contributing

Want to improve this project? Contributions are welcome!

Fork the repo
Create a new branch
Submit a pull request

📜 License

This project is licensed under the MIT License.

📧 Contact

For queries, reach out to: ✉️ jaspreetsingh01110@gmail.com

Name	Name	Last commit message	Last commit date
Latest commit JaspreetSingh-exe Create LICENSE Feb 21, 2025 9d77a31 · Feb 21, 2025 History 10 Commits
models	models	Added models with Git LFS	Feb 21, 2025
.DS_Store	.DS_Store	Added models with Git LFS	Feb 21, 2025
.gitattributes	.gitattributes	Added models with Git LFS	Feb 21, 2025
.gitignore	.gitignore	training commit	Feb 21, 2025
LICENSE	LICENSE	Create LICENSE	Feb 21, 2025
README.md	README.md	Update README.md	Feb 21, 2025
cleaned_mbti_data.zip	cleaned_mbti_data.zip	Compressed dataset to stay within GitHub limits	Feb 20, 2025
data_visualization.ipynb	data_visualization.ipynb	initial commit	Feb 20, 2025
evaluate_models.ipynb	evaluate_models.ipynb	Added models with Git LFS	Feb 21, 2025
mbti_data.csv	mbti_data.csv	initial commit	Feb 20, 2025
model_comparison_results.csv	model_comparison_results.csv	Added models with Git LFS	Feb 21, 2025
requirements.txt	requirements.txt	commit	Feb 21, 2025
training.ipynb	training.ipynb	training commit	Feb 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Personality Prediction using Deep Learning

📌 Project Overview

🛠 Installation & Setup

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Run the Project

📂 Dataset

📂 Project Structure

🤖 Understanding LSTM, Bi-Directional LSTM & BERT

🔹 What is LSTM (Long Short-Term Memory)?

Example Code for LSTM:

🔹 What is Bi-Directional LSTM?

Example Code for Bi-Directional LSTM:

🔹 What is BERT (Bidirectional Encoder Representations from Transformers)?

Example Code for BERT:

🏆 Model Comparison

🤝 Contributing

📜 License

📧 Contact

About

Releases

Packages

Languages

License

JaspreetSingh-exe/Personality-Prediction-Using-Deep-Learning

Folders and files

Latest commit

History

Repository files navigation

Personality Prediction using Deep Learning

📌 Project Overview

🛠 Installation & Setup

1️⃣ Clone the Repository

2️⃣ Install Dependencies

3️⃣ Run the Project

📂 Dataset

📂 Project Structure

🤖 Understanding LSTM, Bi-Directional LSTM & BERT

🔹 What is LSTM (Long Short-Term Memory)?

Example Code for LSTM:

🔹 What is Bi-Directional LSTM?

Example Code for Bi-Directional LSTM:

🔹 What is BERT (Bidirectional Encoder Representations from Transformers)?

Example Code for BERT:

🏆 Model Comparison

🤝 Contributing

📜 License

📧 Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages