Skip to content

To predict the letter/number of a given handwritten alphanumeric character using pixel data; Skills Used: Neural Networks, SVM, kNN, Cluster Analysis, Exploratory Analysis

Notifications You must be signed in to change notification settings

wisamb/OfflineCharacterRecognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 

Repository files navigation

Offline Character Recognition System

Programming Languages/Software: R, RStudio

Skills Used:
Neural Networks
SVM
kNN
Cluster Analysis
Exploratory Analysis

Introduction

Handwriting is unique to every individual. The ability to systematically recognize letters that are handwritten has made great strides; however, there still remains a level of complexity due to the individuality of how each one of us writes a letter or word. This project uses machine learning and cluster analysis to recognize handwritten alphanumeric characters.

Exploratory Analysis

The data contains 11,000 rows and 3,136 predictor variables (Pixel.1 to Pixel.3136), all of which were binary variables containing only 0 or 1. When consumed as a unit, these predictor variables form a 56 x 56 image of the represented digit. There are no missing values in this dataset.

Sample of plotted letters and numbers are provided below.

Looking at the distribution of each character, there is little variability on average in how people draw their characters. 1's and K's seem to have the widest distributions while 9's and J's have the smallest. Furthermore, there are few outliers in this dataset and some characters have no outliers at all.

Model Analysis

Model Test Error Rate (%)
kNN 31.3
NNET 33.9
SVM 20.0

About

To predict the letter/number of a given handwritten alphanumeric character using pixel data; Skills Used: Neural Networks, SVM, kNN, Cluster Analysis, Exploratory Analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published