250 likes | 360 Views
FLAVIUS Meeting – WP4. June 7, 2010. Giurgiu Bogdan Wong William. Agenda. About Language Weaver R&D work Customer experience LW core mission LW architectural contribution Deliverables Roadmap Internal milestone rollup Questions & Answers. Language Weaver at a Glance.
E N D
FLAVIUS Meeting – WP4 June 7, 2010 Giurgiu Bogdan Wong William
Agenda • About Language Weaver • R&D work • Customer experience • LW core mission • LW architectural contribution • Deliverables • Roadmap • Internal milestone rollup • Questions & Answers
FLAVIUS Contributors • LW SRL, Romania • Daniel Marcu, CTO • Ionel Condor, Engineering Manager • Bogdan Giurgiu, Project Manager • Ana Totea, Engineer • Bogdan Faraga, Engineer • Daniel Sarbe, Engineer • Matei Nicolae, Engineer
R&D Projects • Research Projects • Improve syntax-based SMT (DARPA funded) • Small footprint systems for SMT • Domain customization techniques for SMT • R&D Projects • GALE Operational Engines – Broadcast Monitoring Solutions (speech2text translation) • FAUST – FP7 EC Project
Currently Available Language Pairs *Latest product release enables LW to translate to and from any language that is available with limited quality
Our Customer Deployments FACT INFLUENCE QUALITY Legal Marketing Publications Post Edit Trained Customer Care Digital Content Baseline Governments
Our Customer Experience FACT INFLUENCE QUALITY • Baselines are inadequate for FAUT (fully automated useful translation) • Lacks utility of translation (usefulness) • Basic translation(gisting) does not convey publisher needs such as terminology Legal Marketing Publications Post Edit Trained Customer Care Digital Content Baseline Governments
Customer Experience FACT INFLUENCE QUALITY Legal Marketing Publications Post Edit Trained Customer Care Digital Content Baseline • Human post edit for preservation of publisher voice • Humans productivity limited to 2.500 words per day • High cost prevents time critical high volume publication and user generated content Governments
Customer Experience FACT INFLUENCE QUALITY • Convergence of utility vs. ROI • Proven trust in actionable content over baseline engines • Significant cost reduction from influence oriented communications • Liberates publisher & user generated content Legal Marketing Publications Post Edit Trained Customer Care Digital Content Baseline Governments
Core Mission for FLAVIUS Accelerate the adoption of FAUT on a broad scale by leveraging easy customization of domain verticals for content publishers.
Keys to a Successful Partner Integration • Ability to integrate with Language Weaver Machine Translation for development and testing • Ability to customize baseline engines with dictionaries • Ability to customize baseline engines with training of domain/customer specific vertical system
Accomplishments To Date (M3) • No pre-financing is expected • Negotiated purchase agreement between LW SRL and Dell Computers • Purchased 14 Dell servers • Purchased Cisco network switch • Entered into collocation agreement between LW SRL and Latisys (hosting location in Irvine, CA) • All hardware delivered to Latisys • LW Inc. IT staff installed and deployed to TOD (Translations on Demand) at 0 cost to LW SRL. • Available languages: English to French, Spanish, Italian, German, Polish, Romanian, Swedish and vice-versa
Current Activities (M3) • LW setup integration partner accounts • Partner start development using TOD REST API: • HTTP base communication protocol • Web 2.0 used by Amazon, Twitter, etc. • Supported text formats: TXT, HTML, TMX, XLIFF
Upgrade TOD Framework (LW Milestone) • Internal milestone for LW to migrate partner accounts to upgraded TOD framework in month 9 • Provide new functionality outside of the FLAVIUS project but materially benefits the teams. • Extends current REST API • Trustscore™ enabled baseline engines • Utility not quality based assessment • Deployed for TripAdvisor and Dell • Reporting of basic statistics
Customization via Dictionary (M12) • REST API enabled dictionary support • Dictionary upload through API • A dictionary will be specific to an account, per language pair • i.e. Dell (account), Eng-Spa(LP), Servers Terminology (dictionary – 1+)
Customization via Training (M21) Parallel Aligned Text LW Training Compute Cloud d Optional: Regression Text Evaluation Product Delivery via TOD • Data: • Fix noisy text • More text • Text alignment • Text segmentation Optional: Test Text
Thank you! Accelerating the way the world communicates