This web site is designed for accessibility. Content is obtainable and functional to any browser or Internet device. This page's full visual experience is available in a graphical browser that supports web standards. See reasons to upgrade your browser.

CRBLP

Homepage

Research

Publications

Technical Reports

Demos-Downloads

People

Internship

Student Projects

Events

Seminar

Links

Contact Information

Center for Research on Bangla Language Processing
BRAC University
66, Mohakhali, Dhaka-1212
Phone: +88 (02) 8824051-4 Ext:4023
Fax: +88 (02) 8810383
crblp@bracu.ac.bd

height=1 src="2 Column Demo_files/180px.gif" width=180 border=0>

::--ACTIVE RESEARCH PROJECTS--::

::DOCUMENT IMAGE PROCESSING::

Optical Character Recognition:
Optical Character Recognition (OCR) is a process to convert printed text Image to editable text that can be used and reformatted for other uses. We are developing a Bangla Character Recognizer that can recognize printed Bangla document and convert to editable text.
Project Page: http://www.bracu.ac.bd/research/crblp/ocr

::SPEECH PROCESSING::

Imagine a day when you wake up, get washed and order your Personal Assistant (who most unfortunately is a machine) to make coffee and to read out the headlines from the newspaper. Now, there are two basic hurdles we need to overcome. First, how do we make a computer understand what we are saying. And second, how do we make the computer say the things we want it to say. To solve the first problem we need speech recognition technology and speech synthesis for the second.

Emotion Recognition from speech:
This projects aims to conduct research on emotion recognition from speech of Bangla native speaker. Right now we are targeting to recognize four emotional states: unemotional (normal), anger, happiness and sadness.
Project page: http://www.bracu.ac.bd/research/crblp/emr


Speech Synthesis:

Speech Synthesis is the artificial production of human speech. A Text-to-speech system converts normal text to speech. We are working on generating speech signal from Bangla Text.
Project page: http://crblp.bracu.ac.bd/demo/tts

Speech Corpus:
A speech corpus (or spoken corpus) is a database of speech audio files and text transcriptions in a format that can be used to create acoustic models of speech recognition and unit selection speech synthesis.This speech corpus contains read speech and continuous speech.
Project page: http://crblp.bracu.ac.bd/demo/speech_corpus/

::COMPUTATIONAL LINGUISTICS::

Corpus Analysis & Corpus Collection:
We have developed a tool for extensive corpus analysis on word frequency distribution. We currently have a corpus of one-year of Prothom-Alo newspaper text and Charjapad and Baru Chandi Das Er Kabbo. We have analyzed our corpus for regularities and anomalies in Bangla Word Usage.
Project Page: http://www.bracu.ac.bd/research/crblp/corpus

Localized URL:
As world researchers thinking and developing standard on localized domain name so why don’t we think about it? CRBLP team is working this project to develop a standard for domain name in Bangla.
Project Page: http://www.bracu.ac.bd/research/crblp/localURL

Lexicon:
We need a rich and informative lexicon for any kind Bangla Language Processing. We have developed a wordlist of 160 thousand words with 1st step parts of speech tagging.
Project Page: http://www.bracu.ac.bd/research/crblp/lexicon

Wordnet:
We need a rich and informative lexicon for any kind Bangla Language Processing. We have developed a wordlist of 160 thousand words with 1st step parts of speech tagging.
Project Page: http://www.bracu.ac.bd/research/crblp/wordnet

Parallel Corpus:
The aim of this project is to make the multi-lingual Bangla and English parallel corpus in electronic and aligned form. The parallel corpus can be used for translation memory, cross-language information retrieval, multi-lingual lexicography, terminology extraction, language study and language learning tool.
Project Page:

 

Center for Research on Bangla Language Processing
BRAC University, Dhaka, Bangladesh
© All Rights Reserved 2008