Skip to content

3. Terminology

We define a text-document as a digital piece of written matter that provides information and is saved in one or multiple files. Example of documents are newspaper, books, book titles or articles. An entire document of one or more files can be partitioned in smaller pieces of text that we denote as segments. Example of segments are lines, sentences, paragraphs, sections among others. We use the term hit to denote that a candidate-term is found for a query-term.