Bing Releases Intelligent Question-Answering Feature to 100+ Languages

Intelligent question-answering is one of the most useful and delightful features of search. As a user, you ask a question (e.g., “what are the benefits of eating apricots”) and can get the answer directly (e.g., info about health and nutrition benefits of apricots) at the top of the page without further need to search for relevant content by yourself. The feature aims to direct users to the most concise and precise answers from web documents, thus saving users time and efforts.  

English-language question answering from web has been enabled on Bing for several years, and another dozen of languages, like French and German, have been added within the last year. But our work isn’t done - there are thousands of languages in the world! Not all of them have rich enough web content to derive good answers, but for those that do, uses of those spoken languages deserve the same useful, delightful, time-saving experience.

Recently, Bing expanded its intelligent question-answering feature to more than 100 languages, making AI and Bing itself more inclusive and accessible. What is amazing is this is achieved by using a language agnostic approach. In other words, the AI model generating the intelligent question-answering in Urdu is the same one generating the intelligent question-answering in Romanian.  Here are some examples of this experience in various languages (if you speak a language other than English, feel free to give it a try, but be reminded to set your browser to the relevant language):

Bing's intelligent question-answering feature in many different languages

Here are search results in different languages for “what are the benefits of eating apricots”:
Example 1 – { what are the benefits of eating apricots }
Example 2 – { hva er fordelene med å spise aprikoser }
Example 3 – { какие витамины есть в абрикосе }
Example 4 – { kayısı yemenin faydaları nelerdir }
Example 5 – { خوبانی کھانے کے کیا فوائد ہیں؟ }
Example 6 – { నేరేడు పండు తినడం వల్ల కలిగే ప్రయోజనాలు ఏమిటి }
 
The magic behind this language agnostic experience is the advanced universal deep pretrained models for natural language understanding and the combination of real-time and near real-time GPU model inference to scale question-answering to markets all around the world.
 

Universal Deep Models for 100+ Languages


Recently, multilingual pre-trained models such as M-BERT, Unicoder, XLM-Roberta and Turing Universal Language Representation model (powered by Microsoft latest cross-lingual innovation InfoXLM) have been developed to learn multilingual language representations by leveraging large-scale multilingual corpuses for cross-lingual pre-training. Those powerful multilingual models are capable of zero-shot or few-shot cross-lingual transferring capabilities. However, when shipping the models to 100 languages, the performance on low resource languages may still have a big gap with rich resource languages (such as English). To further close the gap, a series of cross-lingual techniques have been developed such as task/domain adaption, user feedback-based question answering, data augmentation, etc. Powered by these SOTA cross lingual technologies, our question answering system becomes truly language agnostic.
 

Serving Universal Models at Scale  


Universal models are difficult to serve at web search scale because common approaches to reduce model complexity like distillation can lead to lower quality results. This is especially problematic for question-answering where precision is essential to ensure a positive user experience. However, without reducing model complexity, real-time model inference can take longer than would be acceptable for search engines where users expect fast results.

To address this serving challenge, we augmented our existing real-time question-answering models with a near real-time inference system. This system efficiently runs sophisticated universal models that would typically be too slow for users to generate high quality answers. These answers will be immediately available the next time a similar question is asked.  By combining both instantaneous and near real-time model inference, we’re able to provide more users with direct answers to their questions in the languages they speak. 
 

Summary


We successfully scaled Bing intelligent question-answering feature to over 100 languages and 200 regions in the world. Existing language understanding techniques and platform evolution have made a big break-through for more natural and language agnostic searching experience. We do believe there is still space to continue to improve answers to users’ questions, so stay tuned as there is more innovation to come.

Beyond question-answering, Bing and Microsoft researchers released XGLUE cross lingual benchmark, which aims to help the research community further advance language-agnostic models and make AI systems more inclusive.

Please give Bing intelligent question-answering a try with your language and region setting. We would love to hear your feedback or suggestions! You can give feedback by using either the thumbs up/down button below the answers or the feedback button in the bottom right corner of the search results page.

feedback button on Bing search results page