The IR-NLP Lab at the Faculty of Computer Science, Universitas Indonesia, focuses mainly on the research areas of Information Retrieval, Speech Processing, and Computational Linguistics, which form a basis or foundation for a broad range of applications such as Text Mining Applications, Natural Language Processing tools, Machine Translation, Question-Answering System, Digital Libraries, and Knowledge Management.
To become a leading research laboratory in Information Retrieval (IR) and Natural Language Processing (NLP) that advances ethical, impactful, and human-centered artificial intelligence for society.
Information Retrieval seeks to explore the methods and techniques of organizing, representing, storing, and searching of information in textual and multimedia forms (speech, image, and music).
In our lab, we have conducted several research topics (as well as published several papers) in the area of information retrieval:
Natural Language Processing is a field which tries to model natural language in formal rule representation, or formalism grammar. This representation can be categorized into phonetics, morphology, syntax, semantics, and discourses. These models are implemented as softwares which can process language artifacts, including utterance, sentences, text documents, etc.
We have developed several NLP tools, especially forĀ Indonesian Language, such as:
Indonesian Language is still considered as an Under-resourced Language, which means that we are still lack of language resources to support most of the natural language processing tools.
In our lab, we are also developing several language resources such as:
Text Mining seeks approaches for structuring textual data, deriving patterns from the structured textual, and finally interpreting the results as well as mining useful information from the results.
We have been doing research on the following areas of text mining:
Machine Translation is a sub-field of computational linguistics that seeks computational models to automatically translates text or speech expressed in one language to another language. Information Retrieval Lab has been publishing several works in this area, especially for Indonesia-English translation.