This paper discourses our CLIR experiments performed for the FIRE1 workshop. We had submitted our runs for Adhoc monolingual document retrieval in Hindi and English, and Ad-hoc cross-lingual document retrieval from Hindi to English, and English to Hindi. In this paper, we describe our English to Hindi and Hindi to English CLIR systems and the experiments conducted on them using the FIRE- 2008 dataset.
Telugu is the official language of Andhra Pradesh state and one of the widely spoken languages in the world. However, there is no standard input method, which has a widespread use among Telugu users on computers. In this paper, we describe the design of Telugu soft keyboards, which are based on a set of design principles. We also evaluate these designs along with existing designs and compare their performances.
In this paper we have proposed an approach for automatic language and subject identification for the books of digital library. The important characteristics of function words is explored for language identification. The heuristic search approach is explored for subject identification by matching title words with the keywords of the subjects. The language identification system is developed for five languages namely English, French, German, Italian and Spanish.
In this paper, we describe a sentence position based summarizer that is built based on a sentence position policy, created from the evaluation testbed of recent summarization tasks at Document Understanding Conferences (DUC).We show that the summarizer thus built is able to outperform most systems participating in task focused summarization evaluations at Text Analysis Conferences (TAC) 2008.
Retrieving answer containing passages is a challenging task in Question Answering. In this paper we describe a novel query expansion method which aims to rank the answer containing passages better. It uses content and structured information (link structure and category information) of Wikipedia to generate a set of terms semantically related to the question.