A Lucene 8.0.0 API based java application that given a .txt
file of a book, returns the book's title and date along awith their offsets.
- An analyzer splits apart the tokens inside the file
- The first few tokens (up to 1000 characters in) are searched for labels such as
Title:
andDate:
- Everything right after the labels, including their offsets, are being returned to the screen
java book_info_finder.BookInfoFinder <.txt file path>
.txt
book files taken from Project Gutenberg.
Essential Lucene 8.0.0 API .jar
files included.