国产av不卡一区二区_欧美xxxx做受欧美_成年人看的毛片_亚洲第一天堂在线观看_亚洲午夜精品久久久中文影院av_8x8ⅹ国产精品一区二区二区_久久精品国产sm调教网站演员_亚洲av综合色区无码一二三区_成人免费激情视频_国产九九九视频

Global EditionASIA 中文雙語Fran?ais
Business
Home / Business / Technology

AI's global village opens wider to more voices

Developers look to break from yoke of English language, cater to all groups of people

By Oasis Hu in Hong Kong | China Daily | Updated: 2024-12-06 07:14
Share
Share - WeChat
LU PING/CHINA DAILY

Artificial intelligence engineer Jacky Chan Ho-kit has conflicting feelings about his industry.

While he looks forward to a future where AI reaches its pinnacle — possessing humanlike cognitive capabilities — he is deeply concerned that it will only understand English.

"Given the language status quo, this is highly likely to be a reality rather than just alarmism," he said.

Chan is the chief technology officer at Votee, a Hong Kong-based AI company. He is also a language enthusiast who in his free time follows language bloggers on social media, absorbing their linguistic insights. Through his research, he has learned that many languages are disappearing.

Even though there are around 7,000 languages still in use globally, according to the World Atlas of Languages of UNESCO, only 10 boast more than 200 million speakers. UNESCO has said that a language vanishes every two weeks, with 25 disappearing annually.

In the online realm, the disparity in language usage rates is even more pronounced.

Over the last decade, English content has dominated the internet, accounting for 49.4 percent as of Nov 26 — more than eight times the use of Spanish, the second most prevalent online language at 6 percent, according to a report by W3Techs, a company that conducts global web surveys.

Conversely, the proportion of web pages that use Chinese, the second-most spoken language in the physical world with more than 1.1 billion speakers, has plummeted from 4.3 percent in 2013 to 1.2 percent in 2024.

In the realm of AI, prominent large language models, or LLMs, like Open-AI's ChatGPT4, Google's Gemini, and Anthropic's Claude all use English as their main language.

Mainstream AI language models, particularly those originating in the West, are made for English-speaking audiences, with translations for other languages serving as only a support function, said Cao Jiannong, chair professor in the Department of Computing at Hong Kong Polytechnic University.

Artificial intelligence is a field devoted to developing technologies that can replicate or even surpass human intelligence. Before this vision becomes real, large-scale AI companies will continue to prioritize enhancing AI's intelligence ability, instead of expanding their services to encompass more languages, Cao added.

Chan, CTO at Votee, agreed that the endgame of AI is humanlike intelligence, but questions the consequences if such intelligence can only speak English.

"Wouldn't it be even more unfair to non-English speakers? Wouldn't global cultural diversity be greatly eroded? Wouldn't the gap between the world's rich and poor be wider?" Chan said.

Since last year, Votee, which previously concentrated on automated data collection and analysis, has shifted its focus to developing AI services for lesser-used languages.

This year, it unveiled a Cantonese LLM and is actively pursuing clients in Southeast Asia, Africa, and the Chinese mainland. Future initiatives include the launch of LLMs and other AI services for Javanese in Indonesia, Okinawan in the southern region of Japan, and various Chinese dialects including Shanghainese and Hakka.

"In an increasingly polarized world, we aim to utilize technology to bridge this gap," Chan said.

Data scarcity

The cornerstone of training AI lies in data. A significant hurdle in advancing AI's linguistic prowess is the scarcity of data available in numerous languages, Chan said.

Of about 7,000 languages spoken worldwide, nearly 99 percent are considered low-resource languages, as the data available for computational processing and analysis is limited.

The fact that mainstream AI tools predominantly rely on English corpora, or collection of written text, leads to significant inconvenience when handling other languages, said Ting Paksun, CEO of Votee.

These AI tools often result in inaccuracies and biased content, cultural misunderstandings, business errors, and even legal violations, rendering them unsuitable for use in both casual and formal contexts, Ting said.

On the beneficial side, AI tools hold the potential to streamline operations, boost productivity, and have a direct impact on local economies.

At an investment summit in mid-November in Hong Kong, Daniel Pinto, president of JPMorgan Chase, said that AI contributed approximately $1.3 billion to the group's finances last year, through cost reductions or revenue increases, with projections indicating a rise to $2 billion this year.

Chan warned regions that are unable to leverage AI tools due to language limitations are likely to experience decreased productivity in the future.

To avoid lagging behind European and United States tech giants, governments and major tech firms in some regions have initiated the development of LLMs customized to their linguistic needs, Cao from the Hong Kong Polytechnic University said.

The UAE, for instance, introduced Jais, the highest-quality Arabic AI LLM, in 2023. This year, South Korea's LG Group unveiled Exaone 3, the country's inaugural open-source Korean AI model.

Smaller, nimbler

Many smaller companies around the world are also venturing into the creation of small language models, Cao said.

Asiabots Ltd, a Hong Kong-based artificial intelligence company established in 2017, is one such company.

Chris Shum Chiu-fai, co-founder and CEO of Asiabots, said that the company initially prioritized AI capabilities in Cantonese due to its Hong Kong location. However, over time an increasing number of clients have approached them for AI solutions in various languages.

Their clients encompass government bodies and private enterprises worldwide including from Southeast Asia and Europe. Instead of opting for large language models, they prefer small language models tailored to specific scenarios, such as AI-driven customer service, AI speech recognition technology, and AI text-to-speech tools.

Asiabots' clients include the Hong Kong Special Administrative Region government, which asked them to develop AI tools for translation services between Cantonese and Middle Eastern languages. The request followed this year's Policy Address, which called for attracting more Muslim tourists, and encouraged the city's taxi services to offer information in Arabic for visitors from the Middle East.

In July, a tourism company in Kunigami, Okinawa, Japan, engaged Asiabots to develop an AI tool capable of translating multiple languages, including minor ones such as Vietnamese.

"Japan is preparing to host the World Expo next year. With the anticipated increase in global tourism, many Japanese companies are seeking AI tools, leading to a surge in requests from Japan recently," Shum said.

Specialized needs

Many mainstream AI tools excel at translating between widely spoken languages such as English and Chinese. However, when faced with less common languages, these tools may falter in recognizing speech and converting it into text, resulting in numerous errors.

The primary issue lies in inadequate data for the specific language, Shum said.

In some instances, countries with limited technological infrastructure may find that their online information is predominantly available in English, rather than their native language, as seen in the Philippines and Mongolia.

Some languages have a variety of pronunciations without standardized characters, such as Minnan, a dialect spoken in southern parts of China.

Other languages are fragmented into numerous dialects. In Indonesia, for example, there are more than 300 dialects, which increase the complexity and diversity of the language.

These challenges can be overcome as long as clients have the financial resources to collect the necessary data, Shum said.

Asiabots accumulates data from extensive research and non-infringing open-source repositories, he said. Clients also provide data to the company or fund it to conduct on-site data collection.

After collecting the data, Asiabots collaborates with local universities and recruits native language speakers to refine and localize AI solutions, aligning them with regional cultures and legal frameworks to overcome cultural barriers.

Since its inception, Asiabots has expanded its AI's linguistic repertoire over the past seven years to 22 languages, including Indonesian, Filipino, Portuguese and Hindi, as well as less common dialects.

After establishing language capabilities, the company tailors AI software and hardware to meet specific customer requirements.

For instance, for the Okinawa tourist spot, Asiabots developed an AI translator capable of translating among five languages: Japanese, Chinese, English, Korean and Vietnamese. These languages can also be interchanged with any of the company's 22 language libraries when required, Shum said.

Endangered languages

While commercial demand ensures the survival of languages with a large offline population, those with few speakers, limited commercial interest, and insufficient technological research are at risk of becoming endangered both online and offline, Chan warned.

UNESCO has a classification system for endangered languages. Ones spoken across all age groups and contexts are considered safe, while languages that children no longer learn as their mother tongue are considered endangered. Those spoken solely by grandparents are in extreme peril, and those lacking speakers face extinction.

Based on this definition, even language dialects that are spoken by substantial populations, like Minnan and Hakka, which is primarily used in southern China, face a fight for survival as fewer young people are learning them.

Shum said not preserving an endangered language could lead to a deep sense of regret.

"There are various research directions in AI and we opted to delve into language study from the start, because behind each language lies a unique mode of thought and a profound reservoir of human wisdom," Shum said.

For instance, the Minnan term describing tears as "falling water" reflects a beautiful perspective. Losing such ways of thinking and expression is a loss of culture, and possibly even civilization, Shum said.

Chan said that language is a crucial vessel of intangible cultural heritage, showcasing the history, customs, habits and social relationships of a region, while forming a part of people's individual and collective identity.

"Protecting the cultural value of a language is much more urgent than its commercial worth, yet it often receives inadequate attention," he said.

By preserving the voice and text of a language through a language model, even if the original speakers disappear, people can access its nuances and written form and learn it whenever they want, Chan said.

Money talks

With hundreds of indigenous languages in Africa at risk of extinction, Votee has worked with clients on the continent to assist in language preservation efforts. However, significant challenges stem from Africa's political instability, limited technological proficiency and insufficient technology infrastructure.

In recent years, many clients have asked Asiabots to develop language models for the preservation of endangered languages.

However, all these projects faltered due to a lack of funding for data collection, such as sending researchers into remote mountainous regions to record voices, and process and digitize these recordings, which might cost millions of dollars.

Francis Fong Po-kiu, honorary president of the Hong Kong Information Technology Federation, said that the governments of smaller language communities should recognize the cultural value inherent in these languages.

Chan proposed that global tech firms, language-focused NGOs, linguists and language enthusiasts collaborate to form communities for mutual support and to encourage the contribution of open-source language data.

When developing its Cantonese LLM, Votee collaborated with Cantonese linguists and enthusiasts to establish a Cantonese-centered community. Subsequently, it open-sourced all the data and models within the LLM.

"Cantonese belongs to everyone, not just a select few — it already lacks resources, so why create additional boundaries?" Chan said.

In July this year, SenseTime, an AI software company in Hong Kong, launched a Thai-language LLM.

Lu Lewei, director of the SenseTime Research Institute, said that they paid attention to minor languages because equipping AI with multilingual capabilities is also good for its own improvement.

More importantly, AI was designed to assist humanity, and its future should prioritize broader accessibility and use, and not neglect some groups, Lu said.

"I believe this is the original intent, also the ultimate goal of humanity's pursuit of technological advancement," Lu said.

Top
BACK TO THE TOP
English
Copyright 1994 - . All rights reserved. The content (including but not limited to text, photo, multimedia information, etc) published in this site belongs to China Daily Information Co (CDIC). Without written authorization from CDIC, such content shall not be republished or used in any form. Note: Browsers with 1024*768 or higher resolution are suggested for this site.
License for publishing multimedia online 0108263

Registration Number: 130349
FOLLOW US
CLOSE
 
精品国产精品| 3d成人h动漫网站入口| 国产精品久久久久一区二区三区 | 四虎影院影音| 97在线看免费观看视频在线观看| 久久久久99精品久久久久| 亚洲人成电影在线播放| 精品性高朝久久久久久久| 精品国产欧美一区二区| 日韩女优电影在线观看| 91麻豆精品91久久久久久清纯| 欧美三级一区二区| 欧美午夜一区二区三区免费大片| 一本久道中文字幕精品亚洲嫩| 精品美女久久久久久免费| 亚洲国产成人av好男人在线观看| 亚洲免费观看在线观看| 亚洲视频小说图片| 一区二区三区欧美| 亚洲自拍偷拍欧美| 精品成人乱色一区二区| 精品国产乱码久久久久久婷婷| 亚洲高清不卡在线观看| 性做久久久久久免费观看欧美| 亚洲狠狠爱一区二区三区| 亚洲人成网站影音先锋播放| 一区二区三区在线看| 一区二区三区中文字幕电影 | 日本亚洲不卡| 神马午夜伦理不卡| 黄色成年人视频在线观看| 日本精品在线| 亚洲卡一卡二| 91桃色在线观看| 性欧美freesex顶级少妇| 神马午夜在线视频| 日韩精品影片| 日日夜夜一区| 粉嫩久久久久久久极品| 在线观看欧美理论a影院| re久久精品视频| 成人精品影视| 午夜久久99| 亚洲综合国产| 韩国v欧美v日本v亚洲v| 国产伦精品一区二区三区在线观看| 国产精品456| 99久久精品国产导航| 久久理论电影网| 国产精品久久一级| 亚洲一区二区三区自拍| 午夜久久久久久久久久一区二区| 欧美日韩一区二区精品| 欧美日韩大陆一区二区| 日韩欧美一二三四区| 日韩av在线免费播放| 一本色道久久综合狠狠躁篇的优点 | 久久狠狠婷婷| 国产一区欧美一区| 99久久99久久精品免费看蜜桃| 久久久国产精华| 亚洲欧美色综合| 欧美性猛交xxxx免费看久久久| 欧美日韩另类一区| 亚洲第一福利网站| 俺去亚洲欧洲欧美日韩| 97精品久久久中文字幕免费| 欧美人与z0zoxxxx特| 任你操视频在线观看| 8x8x视频在线| 激情小视频在线| 色网在线观看| 四虎精品在线观看| 婷婷成人在线| 欧美黄色精品| 久久成人精品无人区| 99久久免费视频.com| 综合亚洲深深色噜噜狠狠网站| 欧美日韩国产在线看| 日韩女优视频免费观看| 日韩网站免费观看| 男女啪啪无遮挡| 拍拍拍在线观看视频免费| 超碰在线图片| 黄色在线播放网站| 成人不卡视频| 啪啪亚洲精品| 国产毛片一区| 成人h版在线观看| 一区二区三区四区亚洲| 欧美日韩一本到| 亚洲美女又黄又爽在线观看| 久久久久久com| 一分钟免费观看视频播放www| 亚洲精品少妇久久久久久 | 三级不卡在线观看| 99国产欧美另类久久久精品| 亚洲一二三四区不卡| 欧美一级一区二区| 久久久国产精彩视频美女艺术照福利| 欧美专区日韩| 黄色三级电影网| 成人黄色网址| 欧美a级大片在线| 亚洲精品成人影院| 国产一区二区三区免费看 | 蜜臀91精品国产高清在线观看| 伊人久久大香线蕉av超碰演员| 国产真实精品久久二三区| 中文字幕一区二区三区四区| 精品视频一区二区不卡| 伊人一区二区三区久久精品| 香蕉久久成人网| 成年人羞羞的网站| 性xxxxfjsxxxxx欧美| 亚洲成av人片在线观看www| 欧美激情日韩| 成人黄色大片在线观看 | 亚洲丝袜一区在线| 天堂精品高清1区2区3区| av线上观看| 日韩激情av| 黄色欧美网站| 三级欧美在线一区| 国产精品国产精品国产专区不蜜| 欧美欧美欧美欧美首页| 欧美日韩成人在线播放| 成年人在线免费| 欧美黑人激情| 日韩一区二区三区精品视频第3页| 欧美日韩亚洲三区| av男人天堂一区| 日韩欧美在线网址 | 在线性视频日韩欧美| 好妞色妞国产在线视频| 亚洲mv在线| 97精品国产综合久久久动漫日韩| 日本一区二区三区视频| 国产一区二区精品久久99| 亚洲一卡二卡三卡四卡五卡| 日韩精品极品在线观看播放免费视频| 影音先锋中文字幕在线| 国产网友自拍电影在线| 美女18一级毛片一品久道久久综合| 日产精品一区二区| 成人av资源在线观看| 欧亚洲嫩模精品一区三区| 久久躁日日躁aaaaxxxx| jizz在线播放| aa级大片免费在线观看| 精品国内自产拍在线观看视频| 激情久久五月天| 天天综合天天做天天综合| 少妇久久久久久| 黄色小视频免费看| 亚洲资源一区| 欧美日韩亚洲在线观看| 国产成人精品综合在线观看| 色8久久精品久久久久久蜜| 欧美大成色www永久网站婷| www.麻豆| 制服丝袜专区在线| 欧美一区网站| 亚洲国产精品成人综合色在线婷婷| 精品美女在线播放| 国内精品不卡一区二区三区 | 91久久久久久白丝白浆欲热蜜臀| 欧美天天在线| 国产精品成人免费在线| 日韩久久免费电影| 天堂网在线.www天堂在线| 国产丝袜在线| re久久精品视频| 91亚洲国产成人精品一区二三| 91精品国产91热久久久做人人| 久久久伊人欧美| 天堂аⅴ在线最新版在线| 亚洲综合资源| 日本视频在线一区| 色综合天天在线| 欧美黑人疯狂性受xxxxx喷水| 久久日一线二线三线suv| 在线观看欧美日本| 久久久久久久久久久av| 亚洲精品第一国产综合野草社区| 视频精品导航| 日韩视频在线一区二区三区 | 黄色av资源| 成人精品国产亚洲| 视频在线在亚洲| 好吊成人免视频| 97人人模人人爽人人喊中文字| 在线观看国产v片| 99re8这里有精品热视频免费 | 精品在线手机视频| jizz一区二区| 精品乱人伦小说| 欧洲一级毛片| 在线精品亚洲欧美日韩国产| 一本综合久久| 精品久久中文字幕| 性日韩欧美在线视频| 免费福利在线观看| 蜜臀av免费一区二区三区| 久久亚洲二区三区| 国产丝袜一区视频在线观看| 777丰满影院| 视频欧美精品| 激情综合五月婷婷| 91精品免费在线观看| xxxxwwww欧美| 欧美gay视频| 日本怡春院一区二区| 欧美在线free| 国产老肥熟xxxx在线观看| 91超碰在线| 亚欧美中日韩视频| 欧美视频日韩视频| 国产卡二和卡三的视频| 美女高潮在线观看| 免费视频一区| 欧美日韩一本到| 国产超级av| 日本韩国欧美| 狠狠色综合播放一区二区| 在线观看不卡| 国产精品一卡二| 91精品麻豆日日躁夜夜躁| xx00欧美| 日韩三区免费| 久久精品av麻豆的观看方式| 欧美日韩国产大片| 欧美日韩一区二区三区四区五区| 中文天堂最新版本在线观看| 成人看片免费| 在线视频观看日韩| 日韩欧美一区二区三区久久| 中文字幕的av| 人人超在线公开视频| 国产日韩精品视频一区二区三区| 日韩欧美精品网站| 国产一级片子| 日韩精品三区| 国产成人午夜精品5599| 亚洲精品国产精品国产自| 高清国产福利在线观看| 日韩欧美国产精品综合嫩v| 亚洲精品中文字幕在线观看| 午夜精品www| gogo在线高清视频| 国产一区成人| 欧美高清视频一二三区 | 国产精品jvid在线观看| 三级成人在线| 国产成人免费av在线| 国产一区二区三区在线免费观看| 欧美最顶级a∨艳星| 99re6这里只有精品| 图片区小说区区亚洲影院| 欧美人乱大交xxxxx| 四虎4545www精品视频| 国产成人免费xxxxxxxx| 国产亚洲欧美另类中文| 亚在线播放中文视频| 亚洲九九视频| 在线观看一区不卡| 成年女人在线视频| 久久悠悠精品综合网| 亚洲精品写真福利| 亚洲精品国产一区二区在线| 3d欧美精品动漫xxxx无尽| www.亚洲在线| 欧美成人小视频| 岛国中文字幕在线| 日本va欧美va欧美va精品| 亚洲精品福利资源站| 免费观看v片在线观看| 欧美一区影院| 88在线观看91蜜桃国自产| 在线看片地址| re久久精品视频| 色综合久久久网| 先锋影音在av资源看片| 婷婷亚洲精品| 天天综合网天天综合色| 中国xxxx自拍视频| 成人直播在线观看| 亚洲三级理论片| 国产精彩视频在线观看免费蜜芽| 日韩有码欧美| 中文字幕视频一区二区三区久| 久久爱www| 国产精品第一国产精品| 中文字幕欧美激情一区| 久艹在线播放| 在线欧美激情| 亚洲三级电影全部在线观看高清| 久热在线视频精品网站| 中文字幕区一区二区三| 亚洲免费观看高清在线观看| 国产一卡2卡3卡四卡网站| 99精品在免费线中文字幕网站一区| 亚洲免费在线电影| 国产黄色小视频| 久久久伦理片| 欧美视频一区二区三区…| 一级欧洲av| 亚洲国产成人精品女人| 欧美电影一区二区| 蜜臀在线观看| 亚洲永久在线| 国产亚洲aⅴaaaaaa毛片| a级影片在线观看| 国产精品亚洲午夜一区二区三区| 久久中文字幕在线| 日产福利视频在线观看| 久久久国产午夜精品| 欧美老**bbbb毛片| 国产欧美自拍一区| 高潮白浆女日韩av免费看| 69ww免费视频播放器| 欧美午夜国产| 国产婷婷色综合av蜜臀av| 成人欧美在线| 99re在线精品| 欧美另类videosbest视频| 欧美激情影院| 欧美综合在线视频| 丁香花高清电影在线观看完整版| 日韩一级欧洲| 中文精品99久久国产香蕉| 九色porny丨入口在线| 中文字幕精品在线不卡| 国产成在线观看免费视频| 综合亚洲色图| 欧美精品久久99久久在免费线 | 中文字幕日本精品| 日韩精品av| 国产精品素人一区二区| 91超碰在线观看| 欧美oldwomenvideos| 精品国产乱码91久久久久久网站| 伊人免费在线| gogogo免费视频观看亚洲一| 久热中文字幕在线观看| 欧美美女在线直播| 欧美人狂配大交3d怪物一区| 二区三区在线播放| 国产suv精品一区二区三区| 一区免费观看| 天堂成人娱乐在线视频免费播放网站 | 午夜国产精品视频免费体验区| 亚洲国产精品久久| 日韩伦理av| 国产精品欧美一区喷水| 9l视频自拍蝌蚪9l视频| 亚洲日本激情| 久久精品国产成人| 国产一区高清| 欧美视频在线视频| 色视频在线观看福利| 国产精品69毛片高清亚洲| 亚洲精品乱码电影在线观看| 九九久久婷婷| 亚洲成色777777女色窝| 后进极品白嫩翘臀在线播放| 国产精品美日韩| 天干夜天天夜天干天ww| 亚洲一区不卡| 国语自产偷拍精品视频偷| 高清精品视频| 91精品国产91综合久久蜜臀| 国产理论在线观看| 国产亚洲一区二区在线观看| 你懂的网站在线观看| 欧美全黄视频| 粗暴蹂躏中文一区二区三区| 99精品女人在线观看免费视频 | h网站在线看| 亚洲一区久久| 午夜精品久久久久久久男人的天堂 | 亚洲国产99精品国自产| 乱馆动漫1~6集在线观看| 一区二区三区日韩精品| 亚洲小说区图片区情欲小说| 国产精品综合久久| 国产又猛又粗| 欧美成人高清| 美女黄色丝袜一区| 免费成人三级| 亚洲精品第一国产综合精品| 波多野结衣亚洲一二三| 欧美色另类天堂2015| 国产女主播在线直播| 久久这里只有精品6| 免费色片视频| 久久精品九九| 最近免费中文字幕mv视频| 99国产**精品****| 久久精品99无色码中文字幕 |