a tool for text analysis


TACL is a tool for performing basic text analysis on a corpus of texts developed by Michael Radich and Jamie Norrish.

TACL can, with minor modifications, be used for any texts, though it is designed specifically for the texts available from the Chinese Buddhist Electronic Text Association (CBETA).

The basis of the analysis it enables is to divide up the corpus texts into their consistuent n-grams, and allow querying for the differences and intersections of these n-grams between arbitrary groupings of texts.