|
| 1 | +# **Pet's Facial Expression Detection** |
| 2 | + |
| 3 | +### 🎯 Goal |
| 4 | +This project aims to develop an advanced image classification system that can accurately identify the facial expressions of pets using cutting-edge deep learning techniques. |
| 5 | + |
| 6 | +### Purpose |
| 7 | +Understanding pet emotions through their facial expressions can significantly enhance the bond between pets and their owners. This project aims to leverage advanced AI models to detect and classify various facial expressions in pets, aiding in better care and understanding of pets' emotional states. |
| 8 | + |
| 9 | +### 🧵 Dataset |
| 10 | +The dataset used in this project is: https://www.kaggle.com/datasets/anshtanwar/pets-facial-expression-dataset |
| 11 | + |
| 12 | +### 🧾 Description |
| 13 | +This project focuses on developing a sophisticated image classification system to identify different facial expressions in pets using deep learning techniques. The system leverages three state-of-the-art convolutional neural network (CNN) architectures: EfficientNet and ResNet50, and Vision Transformer (ViT) architecture, each known for their robustness and high performance in image recognition tasks. |
| 14 | + |
| 15 | +### 🚀 Models Implemented |
| 16 | +- ViT Transformer: Utilizes the transformer architecture for image classification, providing state-of-the-art accuracy. |
| 17 | +- EfficientNet: A family of models that balance model depth, width, and resolution, achieving high performance with fewer parameters. |
| 18 | +- ResNet50: Employs residual learning to train very deep networks, addressing the vanishing gradient problem and achieving high accuracy. |
| 19 | + |
| 20 | +### 📚 Libraries Needed |
| 21 | +- TensorFlow: For building and training deep learning models. |
| 22 | +- Keras: For simplifying the creation and training of neural networks. |
| 23 | +- NumPy: For numerical computations and array operations. |
| 24 | +- Pandas: For data manipulation and analysis. |
| 25 | +- Matplotlib: For plotting and visualizing data. |
| 26 | + |
| 27 | +### 📊 Exploratory Data Analysis Results |
| 28 | +Exploratory Data Analysis (EDA) involved examining the distribution of the dataset, visualizing sample images, and understanding the different classes of facial expressions. The dataset was split into training and testing sets to evaluate the model's performance effectively. |
| 29 | + |
| 30 | +### 📈 Performance of the Models based on the Accuracy Scores |
| 31 | + |
| 32 | +**ViT Performance** |
| 33 | + |
| 34 | + |
| 35 | + |
| 36 | +**EficientNet Performance** |
| 37 | + |
| 38 | + |
| 39 | + |
| 40 | +**ResNet50 Performance** |
| 41 | + |
| 42 | + |
| 43 | + |
| 44 | + |
| 45 | +### 📢 Conclusion |
| 46 | +The Vision Transformer (ViT) model successfully classified the facial expressions of pets with high accuracy. The EfficientNet model showed moderate performance, and the ResNet50 model achieved the highest accuracy among the three models. Each model's performance demonstrates the potential of using advanced neural network architectures for image classification tasks in the domain of pet emotion detection. |
| 47 | + |
| 48 | +### Accuracy Results |
| 49 | +The model's performance was evaluated using accuracy and F1 scores. |
| 50 | +1. ViT |
| 51 | + - Training Accuracy: 80.0% |
| 52 | + - F1 Score: 0.8 |
| 53 | +2. EfficientNet |
| 54 | + - Training Accuracy: 57.7% |
| 55 | + - F1 Score: 0.54 |
| 56 | +3. ResNet50 |
| 57 | + - Training Accuracy: 97.99% |
| 58 | + |
| 59 | +### Best Fitted Model |
| 60 | +The ResNet50 model proved to be the best-fitted model for this task, achieving the highest accuracy and demonstrating robustness in classification. |
| 61 | + |
| 62 | +## ✒️ Contributor |
| 63 | +- Name: Khushi Kalra |
| 64 | +- Github: https://www.github.com/abckhush |
0 commit comments