999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

機器翻譯的風險

2017-05-02 22:43:30ByArthurGoldhammer
英語學習 2017年4期
關鍵詞:規則計算機人工智能

By+Arthur+Goldhammer

The ideal translator is a person “on whom nothing is lost,” said Henry James. Or maybe its a machine. But a machine wont stop you from swearing at nuns...

Years ago, on a flight from Amsterdam to Boston, two American nuns seated to my right listened to a voluble1 young Dutchman who was out to discover the United States. He asked the nuns where they were from. Alas, Framingham, Massachusetts was not on his itinerary, but, he noted, he had“shitloads of time and would be visiting shitloads of other places”.2

The jovial young Dutchman had apparently gathered that“shitloads” was a colourful synonym for the bland “lots”.3 He had mastered the syntax of English and a rather extensive vocabulary but lacked experience of the appropriateness of words to social contexts.4

This memory sprang to mind with the recent news that the Google Translate engine would move from a phrase-based system to a neural network. Both methods rely on training the machine with a “corpus”5 consisting of sentence pairs: an original and a translation. The computer then generates rules for inferring, based on the sequence6 of words in the original text, the most likely sequence of words from the target language.

The procedure is an exercise in pattern matching. Similar pattern-matching algorithms are used to interpret the syllables you utter when you ask your smartphone to “navigate to Brookline” or when a photo app tags your friends face.7 The machine doesnt “understand” faces or destinations; it reduces them to vectors8 of numbers, and processes them.

I am a professional translator, having translated some 125 books from the French. One might therefore expect me to bristle9 at Googles claim that its new translation engine is almost as good as a human translator, scoring 5.0 on a scale of 0 to 6, whereas humans average 5.1. But Im also a PhD in mathematics who has developed software that “reads” European newspapers in four languages and categorises the results by topic. So, rather than be defensive about the possibility of being replaced by a machine translator, I am aware of the remarkable feats of which machines are capable, and full of admiration for the technical complexity and virtuosity of Googles work.10

My admiration does not blind me to the shortcomings of machine translation, however. Think of the young Dutch traveler who knew “shitloads” of English. The young mans fluency demonstrated that his “wetware”—a living neural network, if you will—had been trained well enough to intuit the subtle rules (and exceptions) that make language natural.11 Computer languages, on the other hand, have context-free grammars. The young Dutchman, however, lacked the social experience with English to grasp the subtler rules that shape the native speakers diction, tone and structure. The native speaker might also choose to break those rules to achieve certain effects. If I were to say “shitloads of places”rather than “lots of places” to a pair of nuns, I would mean something by it. The Dutchman blundered into inadvertent comedy.12

Googles translation engine is “trained” on corpora ranging from news sources to Wikipedia. The bare description of each corpus is the only indication of the context from which it arises. From such scanty13 information it would be difficult to infer the appropriateness or inappropriateness of a word such as “shitloads”. If translating into French, the machine might predict a good match to beaucoup or plusieurs. This would render the meaning of the utterance but not the comedy,14 which depends on the socially marked“shitloads” in contrast to the neutral plusieurs. No matter how sophisticated the algorithm, it must rely on the information provided, and clues as to context, in particular social context, are devilishly15 hard to convey in code.

The problem, as with all previous attempts to create artificial intelligence (AI)16 going back to my student days at MIT, is that intelligence is incredibly complex. To be intelligent is not merely to be capable of inferring logically from rules or statistically from regularities. Before that, one has to know which rules are applicable, an art requiring awareness of sensitivity to situation. Programmers are very clever, but they are not yet clever enough to anticipate the vast variety of contexts from which meaning emerges. Hence even the best algorithms will miss things—and as Henry James put it, the ideal translator must be a person “on whom nothing is lost”.

This is not to say that mechanical translation is not useful. Much translation work is routine. At times, machines can do an adequate job. Dont expect miracles, however, or felicitous literary translations, or aptly rendered political zingers.17 Overconfident claims have dogged18 AI research from its earliest days. I dont say this out of fear for my job: Ive retired from translating and am devoting part of my time nowadays to…writing code.

亨利·詹姆斯說,理想的譯者應該是“一無所失”之人。或者,是一無所失之機器。但是,機器可不會教你不能在修女面前爆粗口。

幾年前,我從阿姆斯特丹乘機前往波士頓,兩位美國修女坐在我右邊,聽一個正要去探索美國的荷蘭小伙子侃侃而談。他問修女從哪兒來。啊,馬薩諸塞州的弗雷明漢,可惜不在他的行程計劃之內。但是他說,他有“賊他媽多的時間,可以去賊他媽多的其他地方”。

這個熱情友好的荷蘭小伙子顯然知道,“賊他媽多”跟普普通通的“很多”比起來,有趣得多。他掌握了英語的句法,有相當豐富的詞匯量,卻缺乏交際經驗,來判斷用詞是否合乎語境。

想起這件事,是因為有新聞說,谷歌翻譯引擎將從一個基于短語的系統,變成一個神經網絡系統。兩種方法都以語料庫為基礎,訓練計算機掌握多個由原文和譯文搭配組合的句子。計算機由此總結出一套規則,可以根據原句的詞語排列,推導出目標語言最有可能的詞語排序。

整個過程屬于模式匹配的訓練。當智能手機識別你的語音提問“導航到布魯克萊恩”,或者當拍照軟件識別你朋友的面部時,運用的也是類似的模式匹配算法。計算機并不能“理解”人臉或者目的地,而是把它們變成向量,再進行處理。

我是專業譯者,譯了差不多有125本法語書。有人因此可能會覺得,我看到谷歌的下述言論會很生氣:谷歌新的翻譯引擎跟人工譯者一樣好;若滿分6分,谷歌可以打到5分,而人類的平均水平也只有5.1分。但我同樣也是數學博士,我開發出來的軟件可以“閱讀”歐洲四種語言的報紙,再按主題將它們歸類。所以,我對機器翻譯取代人工翻譯并沒有多大戒心,反而非常清楚機器所取得的非凡成就,相當佩服谷歌復雜而精湛的技術。

佩服歸佩服,我也不會對機器翻譯的缺陷視而不見。想想那個會說“賊他媽多”的荷蘭年輕人,他流利的英語顯示他的“濕件”—— 一個活生生的神經網絡系統——已經訓練得足以感覺出一些細微規則(和例外),從而使語言自然流暢。相反,計算機語言則是純粹脫離語境的語法。然而,那位年輕的荷蘭人因缺乏英語社會經驗而無法掌握母語使用者在措辭、語氣和句子結構方面更微妙的規則。當然,母語使用者也可能有意打破這些規則,以達到某種效果。如果我對兩個修女說“賊他媽多地方”,而不是“很多地方”,我可能是話里有話。那個荷蘭人在誤打誤撞中造成了一種喜劇效果。

谷歌翻譯引擎所用的語料庫來自各種新聞資源和維基百科。對每個語料庫僅有的描述也就成了關于語境的唯一線索。從這少得可憐的信息當中,很難推斷像“賊他媽多”這樣的詞用著合不合適。如果譯成法語,機器可能會認為beaucoup或者plusiers都是很好的選擇。這些詞也許可以達意,但卻喪失了喜劇效果,而這種效果更依賴于帶有社會效應的“賊他媽多”一詞,而非中性的plusiers。不管算法有多復雜,它也得依賴于已有的信息和線索,至于語境,尤其是交際語境,則很難通過編碼來傳達。

人腦實在是太復雜了。我在麻省理工學院讀書時,這個問題就橫亙在創造人工智能的各種努力之前。要想和人類一樣智能,不僅僅是能夠根據規則進行邏輯推理,或是根據規律進行數據演算。在此之前,還得知道哪些規則是可用的,這得具有一種能敏銳覺察當時情況的藝術能力才行。程序員都很聰明,但是還沒有聰明到可以預估意義賴以產生的龐大語境。所以即使是最好的算法,也會有所缺失——所以正如亨利·詹姆斯所說,理想的譯者應該“一無所失”。

這并不是說機器翻譯毫無用處。很多翻譯工作都只是例行公事而已。有時,機器完全可以勝任。但可別指望多大的奇跡,比如貼切的文學翻譯,或者恰當的政治妙語。人工智能的研究從一開始就太過自信。我這么說并不是因為擔心失業:我已經不搞翻譯了,最近正抽空寫代碼呢。

1. voluble: 健談的。

2. itinerary: 旅行計劃,預定行程;shitload: 許多,大量。

3. jovial: 熱情友好的,天性快活的;synonym: 同義詞,近義詞;bland:平和的,溫和的。

4. syntax: 語法,句法;appropriateness:合適,得體。

5. corpus: 語料庫。

6. sequence: 順序,先后次序。

7. algorithm: 算法;syllable: 音節;navigate: 導航。

8. vector: 向量。

9. bristle: 顯得憤怒。

10. feat: 業績,功績;virtuosity: 精湛技巧。

11. wetware: 濕件,計算機專用術語,指軟件、硬件以外的其他“件”,即人腦、大腦神經系統;intuit: 憑直覺知道。

12. blunder: 跌跌撞撞,出漏子;inadvertent: 無意的,非故意的。

13. scanty: 不足的,勉強夠的。

14. render:(用不同的語言)表達,翻譯;utterance: 表達,表述。

15. devilishly: 非常,極其。

16. artificial intelligence (AI): 人工智能。

17. felicitous: 恰當的,貼切的;aptly: 適當地;zinger: 妙語,幽默的話。

18. dog: 作動詞,意為緊隨。

猜你喜歡
規則計算機人工智能
撐竿跳規則的制定
計算機操作系統
數獨的規則和演變
基于計算機自然語言處理的機器翻譯技術應用與簡介
科技傳播(2019年22期)2020-01-14 03:06:34
2019:人工智能
商界(2019年12期)2019-01-03 06:59:05
人工智能與就業
IT經理世界(2018年20期)2018-10-24 02:38:24
讓規則不規則
Coco薇(2017年11期)2018-01-03 20:59:57
信息系統審計中計算機審計的應用
消費導刊(2017年20期)2018-01-03 06:26:40
數讀人工智能
小康(2017年16期)2017-06-07 09:00:59
TPP反腐敗規則對我國的啟示
主站蜘蛛池模板: 青青国产视频| 99精品免费欧美成人小视频 | 国产男女免费完整版视频| 国产不卡国语在线| 国产欧美视频在线观看| 四虎影视国产精品| 亚洲成人在线免费观看| 午夜毛片免费观看视频 | 极品性荡少妇一区二区色欲| 国产精品毛片一区| 久久99国产乱子伦精品免| 亚洲国产精品VA在线看黑人| 国产精品视频免费网站| 亚洲欧美国产视频| 美女毛片在线| 亚洲欧美激情小说另类| 亚洲国产日韩一区| 亚洲αv毛片| AV色爱天堂网| 精品少妇人妻无码久久| 91口爆吞精国产对白第三集| 精品一區二區久久久久久久網站| 亚洲成人播放| 久久婷婷国产综合尤物精品| 国产亚洲精品自在线| 91免费国产高清观看| 午夜日本永久乱码免费播放片| 不卡午夜视频| 国产成人综合网在线观看| 色婷婷电影网| 日韩成人在线网站| 亚洲男人的天堂在线观看| 精品人妻AV区| 国产精品欧美激情| 午夜丁香婷婷| 91麻豆精品国产高清在线| 一本一道波多野结衣av黑人在线| 国产精品无码一二三视频| a天堂视频| 国产在线精品人成导航| 亚洲日韩精品无码专区97| 成人精品视频一区二区在线| 久热精品免费| 国产精品极品美女自在线| 国产精品短篇二区| 国产一级裸网站| 国产区91| 久久国产精品国产自线拍| 精品国产欧美精品v| 99精品国产高清一区二区| 青青极品在线| 波多野结衣国产精品| 国产第四页| 久久午夜夜伦鲁鲁片不卡| 亚洲青涩在线| 91在线丝袜| 久久性妇女精品免费| 久久国产精品电影| 香蕉eeww99国产在线观看| 国产精品视频久| 国产幂在线无码精品| 国产成人亚洲欧美激情| 国产精品无码影视久久久久久久| 国产99视频在线| 亚洲一道AV无码午夜福利| 无码丝袜人妻| 欧美三级日韩三级| 精品欧美视频| 国产91视频免费观看| 91精品国产自产91精品资源| 欧美丝袜高跟鞋一区二区| 欧美、日韩、国产综合一区| 中文国产成人精品久久| 中文字幕人成乱码熟女免费| 日本在线欧美在线| 欧美精品三级在线| 成人午夜天| 四虎永久免费网站| 伊人久久大香线蕉影院| 欧美狠狠干| 免费国产福利| 国产av无码日韩av无码网站|