I am now at the second part of the trinity previously mentioned in this article: the encoding. This portion of the work has already been partly explained in some of the first posts  but it was mainly about the content of the XML tree that will be used for each letter of the corpus. This…

Read More