Organizing Portuguese Legal Documents through Topic Discovery

<p>A significant challenge in the legal domain is to organize and summarize a constantly growing collection of legal documents, uncovering hidden topics, or themes, that later can support tasks such as legal case retrieval and legal judgment prediction. This massive amount of digital legal documents, combined with the inherent complexity of judiciary systems worldwide, presents a promising scenario for Machine Learning solutions, mainly those taking advantage of all the advancements in the area of Natural Language Processing (NLP).</p> <p>Brazil, a Portuguese speaking country, has a very large and complex judiciary system with over&nbsp;<a href="https://www.cnj.jus.br/pesquisas-judiciarias/justica-em-numeros/" rel="noopener ugc nofollow" target="_blank">27,7 million new legal cases in the year of 2021</a>. It is in this scenario that&nbsp;<a href="https://sobre.jusbrasil.com.br/" rel="noopener ugc nofollow" target="_blank">Jusbrasil</a>, the largest legal tech company in Brazil, with around 2 million accesses per day and a collection composed of billions of legal documents, operates. Besides the ever growing corpus, an additional challenge when dealing with documents from the legal domain is the complexity and uniqueness of the legal language.</p> <p><a href="https://medium.com/jusbrasil-tech/organizing-portuguese-legal-documents-through-topic-discovery-65384b37b92a"><strong>Read More</strong></a></p>