diff --git a/README.md b/README.md index 3076250..e82a622 100644 --- a/README.md +++ b/README.md @@ -6,9 +6,9 @@ This repo consists of a source code of a Python script which detects plagiarism ## How is it Done? -You might be wondering how plagiarism detection on textual data is done, well it ain't as complicated as you may think. +You might be wondering how plagiarism detection on textual data is done. Well, it ain't as complicated as you may think. -We all know that computers are good with numbers; so in order to compute the similarity between two text documents, the textual raw data is transformed into vectors => arrays of numbers and from that, we make use of basic knowledge of vectors to compute the similarity between them. +We all know that computers are good with numbers, so in order to compute the similarity between two text documents, the textual raw data is transformed into vectors => arrays of numbers and from that, we make use of basic knowledge of vectors to compute the similarity between them. This repo contains a basic example on how to do that.