百家乐怎么玩-澳门百家乐官网娱乐城网址_网上百家乐是不是真的_全讯网888 (中国)·官方网站

Words on the web: a new direction for semantics

 

A Website Management Programme for students in the Bachelor of Arts with Honours Degree in Language Information Science, offered by CityU's Department of Chinese,Translation and Linguistics, is proving a great success, with student participation now including a large number of students from all three years. The programme is run by Dr Jonathan Webster, Acting Head of CityU's Department of English and Communication. It is one of several projects in which Dr Webster is involved in affiliation with the Institute of Chinese Linguistics (ICL). "The programme has grown considerably from when there were maybe five or six students interested to the present, when almost the entire class is involved. It reflects a great change in student awareness of the importance of the web and IT, in terms of their future careers." 

The programme, which has been running for several years, provides an opportunity for students to gain hands-on experience in managing a web server. The initial focus was putting together a website that was directly related to topics in Chinese language and linguistics. Over the years this has expanded, as students have become involved in other projects, such as putting together a "grammar surgery" for the English Language Centre. It's a computer-assisted language-learning tool for students, with a fun slant of going to the "surgery" to visit the "grammar doctor".

Machine translation project

One of the more challenging projects currently being undertaken in affiliation with the ICL is the application of semantic web technology for an example-based machine translation project (EBMT). Other colleagues from the Department of Chinese, Translation and Linguistics participating in the project along with Dr Webster are Dr K K Sin , Dr H Pan and Mr Caesar Lun .

The EBMT project applies the "Example-based" approach to the translation of the specialized language of legislation and legal documents. The corpus being used with the project consists of bilingual law dictionaries and glossaries, the bilingual texts of Hong Kong legislation, conveyance documents and other legal materials such as judgments, court documents and contracts. The purpose of the project is to meet the growing demand for bilingual legal texts as Hong Kong's legal system converts from a monolingual to a bilingual legal system, while at the same time exploring the full potential of the example-based approach.

The initial task is to design a best-match algorithm for translated text spans ranging in size and scope from words to phrases, clauses and sentence patterns. The algorithm will be rigorously tested and human input of improved translations will be constantly incorporated into the corpus in order to build up and develop the learning ability of the algorithm. In turn, this will enhance the accuracy, consistency and intelligibility of the translated text.

Going through the phases

The EBMT project, which is funded by the University Grants Committee, has three phases: example acquisition, example application and example-base management.

The first phase, example acquisition, is nearly completed. This has involved the text alignment of the 25 million word Bilingual Laws Information System (BLIS) Corpora. The text alignment occurs at various linguistic levels, including word, phrase, clause and sentence. "The BLIS corpus was selected because it was an amazingly good rich text to work with and it was translated by experts. Progress has been quite good in this very difficult phase at the beginning, which is to get the examples by doing the alignment between English and Chinese," said Dr Webster.

Phase two, the example application phase, is currently in progress and deals with how existing examples are used to facilitate translation. The main issues include identification of useful examples in an input sentence, determination of a sequence of identified examples to be used in composing the translation, and further manipulation of the target language parts to render the composition. This is actually the translation process.

Within a year, the team hopes to have a prototype for example application where the examples are in a database and used to improve translation.

The third phase concerns the management of the example-base where the examples are stored in such a way as to facilitate subsequent retrieval. This method draws on advances in semantic web technology.

The semantic web approach provides the means for rendering information in a machine-processable form. "Basically, the web is now moving in a direction of how you can represent the meaning of the text, instead of just having a repository of documents. Instead it will have a rich knowledge base from which to draw information," Dr Webster explained.

Bilingual dictionary database

Another project Dr Webster is working on uses semantic web technology with a bilingual dictionary database. This will be useful for natural language processing as well as being a practical tool for any users of the web.

"Many databases today are very fixed - you input something and retrieve information following a fixed format. With this technology you will be able to store information and retrieve it using rules and inference. The only way you can do that is if your database is rich in terms of knowledge."

YOU MAY BE INTERESTED

Contact Information

Communications and Institutional Research Office

Back to top
博彩网站排名| 大发888娱乐场下载制度| 永利高百家乐官网进不去| 大发888官方6| 真人百家乐官网视频| 百家乐桌码合| 大发888bet游戏平台| 马牌百家乐官网娱乐城| 网络百家乐证据| 丹阳市| 关于阳宅风水24山知识| 和记国际网上娱乐| 博狗官网| 百家乐反缆公式| 百家乐官网追注法| 申博太阳城娱乐网| 百家乐发牌的介绍| 郁南县| 百家乐巴厘岛娱乐城| 百家乐官网必胜法hk| 万人迷百家乐的玩法技巧和规则 | 百家乐丽| 百家乐庄闲局部失衡| 百家乐官网波音平台有假吗| 百家乐六亿财富| 百家乐官网3宜3忌| 好用百家乐软件| 百家乐官网隔一数打投注法| 火箭百家乐的玩法技巧和规则| 至尊百家乐官网赌场娱乐网规则| 足球现金网开户| 百家乐庄家必赢诀窍| 百家乐官网博百家乐官网| 现金百家乐官网网上娱乐| 娱乐城简介| sz新全讯网网址112| 百家乐平注赢钱法| 网上百家乐官网群的微博| 顶尖百家乐官网开户| bet365最新网址| 大发888dafabet|