百家乐怎么玩-澳门百家乐官网娱乐城网址_网上百家乐是不是真的_全讯网888 (中国)·官方网站

Words on the web: a new direction for semantics

 

A Website Management Programme for students in the Bachelor of Arts with Honours Degree in Language Information Science, offered by CityU's Department of Chinese,Translation and Linguistics, is proving a great success, with student participation now including a large number of students from all three years. The programme is run by Dr Jonathan Webster, Acting Head of CityU's Department of English and Communication. It is one of several projects in which Dr Webster is involved in affiliation with the Institute of Chinese Linguistics (ICL). "The programme has grown considerably from when there were maybe five or six students interested to the present, when almost the entire class is involved. It reflects a great change in student awareness of the importance of the web and IT, in terms of their future careers." 

The programme, which has been running for several years, provides an opportunity for students to gain hands-on experience in managing a web server. The initial focus was putting together a website that was directly related to topics in Chinese language and linguistics. Over the years this has expanded, as students have become involved in other projects, such as putting together a "grammar surgery" for the English Language Centre. It's a computer-assisted language-learning tool for students, with a fun slant of going to the "surgery" to visit the "grammar doctor".

Machine translation project

One of the more challenging projects currently being undertaken in affiliation with the ICL is the application of semantic web technology for an example-based machine translation project (EBMT). Other colleagues from the Department of Chinese, Translation and Linguistics participating in the project along with Dr Webster are Dr K K Sin , Dr H Pan and Mr Caesar Lun .

The EBMT project applies the "Example-based" approach to the translation of the specialized language of legislation and legal documents. The corpus being used with the project consists of bilingual law dictionaries and glossaries, the bilingual texts of Hong Kong legislation, conveyance documents and other legal materials such as judgments, court documents and contracts. The purpose of the project is to meet the growing demand for bilingual legal texts as Hong Kong's legal system converts from a monolingual to a bilingual legal system, while at the same time exploring the full potential of the example-based approach.

The initial task is to design a best-match algorithm for translated text spans ranging in size and scope from words to phrases, clauses and sentence patterns. The algorithm will be rigorously tested and human input of improved translations will be constantly incorporated into the corpus in order to build up and develop the learning ability of the algorithm. In turn, this will enhance the accuracy, consistency and intelligibility of the translated text.

Going through the phases

The EBMT project, which is funded by the University Grants Committee, has three phases: example acquisition, example application and example-base management.

The first phase, example acquisition, is nearly completed. This has involved the text alignment of the 25 million word Bilingual Laws Information System (BLIS) Corpora. The text alignment occurs at various linguistic levels, including word, phrase, clause and sentence. "The BLIS corpus was selected because it was an amazingly good rich text to work with and it was translated by experts. Progress has been quite good in this very difficult phase at the beginning, which is to get the examples by doing the alignment between English and Chinese," said Dr Webster.

Phase two, the example application phase, is currently in progress and deals with how existing examples are used to facilitate translation. The main issues include identification of useful examples in an input sentence, determination of a sequence of identified examples to be used in composing the translation, and further manipulation of the target language parts to render the composition. This is actually the translation process.

Within a year, the team hopes to have a prototype for example application where the examples are in a database and used to improve translation.

The third phase concerns the management of the example-base where the examples are stored in such a way as to facilitate subsequent retrieval. This method draws on advances in semantic web technology.

The semantic web approach provides the means for rendering information in a machine-processable form. "Basically, the web is now moving in a direction of how you can represent the meaning of the text, instead of just having a repository of documents. Instead it will have a rich knowledge base from which to draw information," Dr Webster explained.

Bilingual dictionary database

Another project Dr Webster is working on uses semantic web technology with a bilingual dictionary database. This will be useful for natural language processing as well as being a practical tool for any users of the web.

"Many databases today are very fixed - you input something and retrieve information following a fixed format. With this technology you will be able to store information and retrieve it using rules and inference. The only way you can do that is if your database is rich in terms of knowledge."

YOU MAY BE INTERESTED

Contact Information

Communications and Institutional Research Office

Back to top
亲朋棋牌官方下载| 百家乐官网博彩通博彩网皇冠网澳门赌场真人赌博 | 永利百家乐赌场娱乐网规则| 博彩论坛| 仕達屋百家乐官网的玩法技巧和规则| 全讯网博客| 百家乐官网玩法规| 大发888娱乐场下载 zhldu| 百家乐官网套利| 鹿邑县| 外围赌球软件| 百家乐官网赢钱公式论| 花垣县| 水果机下载| 做生意店铺风水好吗| 清原| 赌场百家乐官网的玩法技巧和规则 | 大发888song58| 百家乐投注网中国体育| 澳门百家乐官网自杀| 长方形百家乐筹码| 赌场百家乐官网欺诈方法| ceo娱乐城信誉| 澳门百家乐官网怎么玩| 百家乐官网自动投注| 南通棋牌游戏金游世界| 威尼斯人娱乐城赌场| 24山风水发几房| 太阳城网上投注| 百家乐有哪几种| 百家乐官网太阳城 | 百家乐的必胜方法| 百家乐官网机器手怎么做弊| 优博在线| 威尼斯人娱乐城平台打不开| 威尼斯人娱乐城 104| YY百家乐的玩法技巧和规则| 赌博百家乐有技巧吗| 澳门百家乐登陆网址| 81数理 做生意| 西游记百家乐官网娱乐城|