370 likes | 393 Views
Kingdom of Saudi Arabia Ministry of Education Umm Al- Qura University Adham University College Computer Science Department. المملكة العربية السعودية وزارة التعليم جامعة أم القرى الكلية الجامعية بأضم قسم الحاسب الآلي. Computer Arabization Systems, 6803530-3. T.Mariah KHayat.
E N D
Kingdom of Saudi Arabia Ministry of Education Umm Al-Qura University Adham University College Computer Science Department المملكة العربية السعودية وزارة التعليم جامعة أم القرى الكلية الجامعية بأضم قسم الحاسب الآلي Computer Arabization Systems, 6803530-3 T.Mariah KHayat
Kingdom of Saudi Arabia Ministry of Education Umm Al-Qura University Adham University College Computer Science Department المملكة العربية السعودية وزارة التعليم جامعة أم القرى الكلية الجامعية بأضم قسم الحاسب الآلي Lecture FiveThe Language Computing and Machine Translation T.Mariah KHayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي REFERENCES: This slide is the translation taken from the slide: [1]Arabization of Computers, by Dr.Husni Al-Muhtaseb. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Content: 1.The Language Computing Term. 2.The Applications. 3.The Natural Language Processing. 4.Machine Translation. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي The Language Computing –The Term: • اللسانيات الحسابية Computational Linguistics • الحوسبة العربية Arabic Computing • The term revolves around how to use the Arabic language processing by the computer • معالجة اللغة العربية بالحاسب الآلي • Natural Arabic Processing Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Examples of Arabic Language Processing Applications by Computer: • Smart Computing Systems. • User interfaces in natural language for databases. • Computer Assisted Education. • Search in the Internet and retrieve data. • Automatic translation from and to Arabic. • Answer questions in natural language. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Examples of Arabic Language Processing Applications by Computer: • Spell Checking programs. • Grammar Checking programs. • Word root retrieval programs. • Sound pronunciation and recognition by computer. • Converts the written Arabic text to Arabic speech. • Convert Arabic speech to written Arabic text. • Optical Character recognition. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Natural Languages Processing: Definition It’s an artificial intelligence discipline, means building programs that have the potential to "understand" a natural language to reach a specific goal. NLP Converting from a representation (input text) to another (internal representation). Understand Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي English Language Example –Talk with Alice-: • http://www.pandorabots.com/pandora/talk?botid=f5d922d97e345aa1&skin=custom_input Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Natural Languages Processing: Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Natural Languages Understanding Phases: Input Morphological analysis Analyze each word into its parts Semantic Analysis Converts inputs into an internal representation that retains the meaning Discourse: -Analysis of intent-: Interpretation of what has been said as intended Syntax Analysis Converts the sentence (strings of words) into specific structures showing the interrelatedness of words Pragmatics Analysis Understanding and referral of links and pronouns between sentences Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Natural Languages Understanding Phases: • 1. Morphology: • How do you build words from smaller components? Knowledge of word root and weight, prefixes and suffixes. • 2. Syntax: • How do words relate to each other to be a correct sentence and what is the role of each word in the syntax? • 3. Semantic: • The meanings of words and how these meanings are sharing in the meaning of the sentence. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Natural Languages Understanding Phases: • 4. Pragmatics: • How to use sentences in different positions and what is the meaning (intend) of the sentence? • 5. Discourse: • What is the immediate effect of the preceding sentences on the understanding of the following sentence? Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Dictionary: • Dictionary: is a vocabulary bank that includes language words and its linguistic characteristics. • There are many free electronic dictionaries in languages other than Arabic. • WordNet, a database of English vocabulary. • Do we build an Arabic dictionary? Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Natural Languages Understanding Phases: Dictionary Input Morphological analysis Analyze each word into its parts Semantic Analysis Converts inputs into an internal representation that retains the meaning Analysis of intent Interpretation of what has been said as intended Syntax Analysis Converts the sentence (strings of words) into specific structures showing the interrelatedness of words Speech Analysis Understanding and referral of links and pronouns between sentences Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Machine Translation: • Translating text from a specific language into another language using advanced information technologies, including computers, programs and methods of artificial intelligence. • The goal of machine translation is to obtain the translated text in high quality and in a complete automation without human intervention. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Machine Translation Example: Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Machine Translation Example: Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي The first example and visualization of the machine translation system was in the middle of the 17th century Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي History: • At the beginning of the seventeenth century someone proposed the formation of mechanical dictionaries. • The first example and dream of the mechanical translation system was published in the mid-17th century. • In 1933, two independent patents were registered in France and Russia for the idea of mechanical translation. • The American scientist of French origin, Georges Artzoni, designed a solid tool on a reinforced paper tape used to find an equivalent and similar to any word in the other language. The initial model appeared in 1937 • The use of computers for translation service was discussed by British scientists in 1949 using the method of punch cards to produce the raw words to translate scientific abstracts. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي History: • World wars have shown the need for immediate machine translation. The first conference on automatic translation was held in 1952. • The first automatic translation project at Georgetown University in cooperation with IBM 1954. • The project was experimented with selected samples from 49 Russian sentences to be translated into English; and with limited vocabulary consisting of 250 words and 6 rules only. • The US Air Force collaborated with IBM to build an integrated system for automatic translation between French and English. • In 1966, the Advisory Committee for Automated Processing of Living Languages in America concluded that machine translation is very slow, less accurate and twice as expensive as human translation and therefore there is no useful or expected benefit from machine translation. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي History: • Canada uses both English and French, making automatic translation very important. • The European Economic Union requires the translation of scientific and technical documents between European languages. • Between 1970 and 1980, several automated translation systems emerged, mostly based on Georgetown University's experience in this field. • Of these systems, which are translating automatically between languages, English, French and English, Italian and English, German and others. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي History: • In the early 1980s, the technique and science of artificial intelligence emerged, raising the level of automatic translation and putting it in a new direction. • First appeared in an American university, and used in the early translation projects a Dutch company (Philips). • Its use has grown exponentially in Japanese computer companies and has led to the emergence of the Japanese machine translation system between Japanese and English by Kyoto University. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي What about Arabic? • Limited companies have emerged in this field, like Sakhr. • The migration of Arab minds to companies and foreign countries and working with them. • The interest in Arabic has increased after the recent international events. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي The Translation stages: • 1. Human Translation: • The Translation is done completely by a human translator. • 2. Translator Help: • The translation is done by a human translator by a help of a machine translation with what can be called machine dictionary. • 3. Editing before translation: • The machine translation is done for simple syntax sentences with specific meanings, and when the exit of some sentences from the frame of the system of machine translation, the user is required to restructure those sentences. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي The Translation stages: • 4. Editing after translation: • The sentences produced by the machine translation system need to be corrected and rewritten. • 5. Machine translation: • The machine translation system translates the text from English to Arabic without assistance, and this is the ultimate goal of automatic or machine translation. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Is it a Dream? • To put an English book about a specific subject, in specific machine, and after a period of time (hours), getting the Arabic translation of the book. • Convert images to characters and store them in computer memory. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Features of Automatic Translation from English to Arabic? Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Features of Automatic Translation from English to Arabic? • Morphological Analysis: • The morphological analysis unit defines vocabulary of words in terms of name, verb, character, and appendices (prefixes, suffixes, and implications) on each. • Syntax Analysis: • This unit determines the location of the vocabulary of words in terms of being a subject, object, adjective, or adverb, etc., that’s for the name. • But for the verb, the verb tense, inflection, etc. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Features of Automatic Translation from English to Arabic? • Semantic Analysis: • A representation of the semantics of text words and their association is done by using knowledge bases that contain the semantic representation of English vocabulary. • The Transport Algorithm: • It transmits semantic representation in English to semantic representation in Arabic. A special bilingual dictionary and Arabic and English knowledge bases defined in the system are used to obtain a representation of meanings in Arabic. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Features of Automatic Translation from English to Arabic? • Building Sentences: • After getting a representation of meanings in Arabic is the process of creating sentences. • Bilingual Dictionary: • The design of the bilingual dictionary requires special attention to its construction and identification of its input elements. It is essential that the lexicon be designed in such a way that it accepts the data and then automatically stores it in the appropriate form and structures. There is no doubt that the flexibility of using the lexicon and the method of data storage and retrieval means has a great impact in evaluating the system, especially as we expect a large dictionary. Previous experiences in this area could be used Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Features of Automatic Translation from English to Arabic? • Artificial Intelligence and Expert Systems: • Tools and methods of artificial intelligence are used in the design and construction of the translation system, and then the tools of expert systems can be used. • We recommend that you add a review module after the text has been translated to eliminate the ambiguity that may sometimes appear for one reason or another. Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Translation Site: • Google Traslate Site: • https://translate.google.com/ Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Translation Site: • https://www.online-translator.com/ Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Translation Application: • iTranslate: • https://itunes.apple.com/us/app/itranslate-translator/id288113403?mt=8 Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي Translation Program: • Oxford: Computer Arabization Systems, 6803530-3, T.Mariah Khayat
Kingdom of Saudi Arabia المملكة العربية السعودية Ministry of Education وزارة التعليم Umm AlQura University جامعة أم القرى Adham University College الكلية الجامعية بأضم Computer Science Department قسم الحاسب الآلي The End Summary of Lecture FiveT.Mariah Khayatالأستاذة/ مارية خياطAdham University Collegeالكلية الجامعية بأضمmskhayat@uqu.edu.samariahkhuqu@gmail.com Computer Arabization Systems, 6803530-3, T.Mariah Khayat