Skip to content

sidorchukandrew/Custom-Wiki-BTree

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Custom-Wiki-BTree

This repository contains two programs, the loader and the application. The loader reads 5 Wiki sites from a text filea and crawls to 19 more from each. A BTree was implemented from the Intro to Algorithms: Chapter 19 to collect wrods from each Wiki site and their frequencies. Once the BTree for each site is created, it's cached and stored persistently. Medoids are created by swapping from our original 5 root Wiki pages 1000 times and also stored persistently. The application part of the project reads all the BTrees into main memory (100 isn't actually that many), allows the user to select a medoid, see what the associated websites are (these were computed in the loader program) and enter a Wiki page of their choice to see which cluster it belongs to.

Assignment 2 Demo

Helpful links

Pseudocode for BTree : http://staff.ustc.edu.cn/~csli/graduate/algorithms/book6/chap19.htm
KMedoid Clustering : https://www.geeksforgeeks.org/ml-k-medoids-clustering-with-example/

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published