HOME > Business Wire > Article
PKSHA develops advanced Large Language Models in collaboration with Microsoft Japan
Featuring rapid communication capabilities, this new LLM generates answers 3 times faster than conventional LLMs, and is aimed towards contact centers and corporate help desks
TOKYO--( BUSINESS WIRE )-- PKSHA Technology Inc. (TOKYO:3993) has developed one of the first Japanese-English Large Language Models (LLM) using Retentive Network (RetNet) (*1) in collaboration with Microsoft Japan Co., Ltd. Through this LLM development, PKSHA will further enhance the practicality of generative AI in the business world, primarily focusing on boosting productivity within contact centers and corporate help desks. Actual operation in business environments will begin in stages from April 2024.
Overview of PKSHA's LLM: First Japanese-English LLM using 'RetNet'
PKSHA has developed a new LLM with the following features, leveraging Azure’s AI Infrastructure and technical assistance from Microsoft Japan.
This model is the world's first (*2) Japanese-English LLM using RetNet, which is anticipated to be a leading successor to the widely-used Transformer. RetNet, developed by Microsoft Research Asia, has a fast learning speed. It also excels in inference speed and memory efficiency when processing long text input, while maintaining or exceeding the accuracy of traditional models. The superior memory efficiency means that the model can run on fewer GPUs (*3) than conventional models, making it more cost effective. This architecture enables our Japanese-English LLM to combine efficiency in long text comprehension with excellent response speed.
This is a 7B parameter model, a size that balances output accuracy and operating cost for implementation in contact centers and corporate help desk operations.
For example, when inputting the text from two pages of a Japanese newspaper (*4), this model can output a response approximately three times faster than a conventional model, without compromising on accuracy. The model's efficiency improves proportionally with the volume of input information.
The LLM has been developed using DeepSpeed (*5), a deep learning framework developed by Microsoft Research. Microsoft provided the RetNet modeling expertise and Azure’s purpose built AI Infrastructure virtual machines optimized for AI workloads, to take advantage of DeepSpeed’s strength - highly parallel and distributed processing capabilities. With RetNet and DeepSpeed, we were able to efficiently train and achieve early performance validation with a prototype model.
The benefit of instant answers will transform contact center and corporate help desk operations
Founded in 2012, PKSHA has been focusing on the research and development of natural language processing (NLP) and the implementation of AI in society, mainly in the field of communication. With a strong track record of more than 6,000 AI implementations, mainly in areas of contact centers and corporate help desks, PKSHA will promote the use of this LLM to achieve further deployment in these areas. Our model is developed on the premise that it will be implemented in the business environment based on a practical understanding of the customer's challenges.
After further verification and refinement, the new LLM is planned to be gradually implemented in actual business environments later this spring. PKSHA regards this LLM as a powerful business asset, and aims to combine it with PKSHA’s other technologies to create a prosperous society in which people's abilities are maximized through the collaboration of people and software.
Comment from Hiromichi Nozaki, Managing Executive Officer and Chief Technology Officer of Microsoft Japan Co., Ltd.
“Microsoft Japan warmly welcomes the development of a new large-scale language model by PKSHA Technology Inc., and its application to these important customer service experiences. In particular, by utilizing RetNet and DeepSpeed to achieve both Japanese and English LLMs, PKSHA can improve the immediacy and accuracy of communication,” said Hiromichi Nozaki, Managing Executive Officer and Chief Technology Officer of Microsoft Japan Co., Ltd. “As we strive to accelerate digital transformation through technological innovation and contribute to the realization of a prosperous society, Microsoft Japan will continue to support PKSHA's vision of 'co-evolution of humans and software' and its innovative efforts to realize this vision.”
*1: RetNet is a technology that is expected to be the successor to Transformer, as it is reported to be a model with the following features compared to Transformer, which is used in almost all LLMs as of March 2024.
- Language performance equivalent to or better than Transformer.
- Fast learning by parallel execution.
- Memory-saving and low-latency inference.
For more information, please see the following research paper:
https://www.microsoft.com/en-us/research/publication/retentive-network-a-successor-to-transformer-for-large-language-models/
*2: According to our own research on open-source models published on Hugging Face, a global platform providing machine learning models and other NLP-related tools and resources.
*3: Graphics Processing Unit. A special device that performs high-speed parallel processing and is widely used in fields such as graphics processing, scientific computing and machine learning.
*4: Assuming approximately 10,000 characters per newspaper page.
*5: DeepSpeed enables world's most powerful language models. It is an easy-to-use deep learning optimization software suite that powers unprecedented scale and speed for both training and inference.
- Reshape Large Model Training Landscape with scale, efficiency, ease
- Optimize Large Model Inference of scale, latency, cost
- Speed Up Inference & Reduce Model Size with compression for efficiency and cost savings.
- DeepSpeed4Science: Microsoft's AI to unlock science mysteries
For more information, please see the following link:
https://www.deepspeed.ai/
View source version on businesswire.com: https://www.businesswire.com/news/home/20240418370582/en/
Source: PKSHA Technology Inc.
Business Wire
- 05/17 12:39 Cleaver-Brooks Acquired by Miura Co., Ltd.
- 05/17 02:09 Accenture to Acquire OPENSTREAM HOLDINGS to Help Clients Advance Their...
- 05/16 16:00 The Baking Action RPG “Aeruta” Launched in Early Access on PC via ...
- 05/15 23:30 Oasis Calls for Improved Shareholder Return & Votes Against Chairm...
- 05/15 15:00 Esperanto Technologies and Rapidus Partner to Enable More Energy-Effic...
- 05/15 13:31 CORRECTING and REPLACING PHOTO ADDING MULTIMEDIA Isuzu Invests US$30 M...
- 05/15 13:00 Asahi Kasei Announces Port Colborne, Ontario, Canada as Location of Fu...
- 05/15 10:23 MUFG Bank, Ltd. announces Consolidated Summary Report [under Japanese ...
- 05/15 09:00 Esperanto Technologies and Rapidus Partner to Enable More Energy-Effic...
- 05/15 08:00 Intergamma and Hanshow Forge Strategic Alliance: Pioneering Digital Re...
- 05/15 05:00 Shin-Etsu Chemical to Build a New Plant for Silicone Products in Zheji...
- 05/15 02:10 Mitsubishi Electric and Musashi Energy Solutions Sign Partnership and ...
- 05/14 23:30 Upstream Welcomes ex-Fujitsu Executive to Expand Operations in Japan
- 05/14 21:00 Stonepeak and CHC Form Japanese Battery Energy Storage Platform
- 05/14 15:49 Nexon Releases Earnings for First Quarter 2024
- 05/14 14:05 Saghmos Therapeutics Announces Issuance of Patent in Japan for Phase 3...
- 05/14 14:03 Rapid Medical™ Completes Successful Ischemic Stroke Procedures with ...
- 05/14 12:30 Asahi Kasei starts operation of multi-module hydrogen pilot plant in K...
- 05/14 12:00 Takeda to Present Oncology Portfolio and Pipeline Data at the 2024 ASC...
- 05/14 11:30 CTCL Global Care Collaborative Pioneers Consensus for Improving Diagno...
- 05/14 07:45 Cvent Announces Top Meeting Destinations and Top Meeting Hotels in Asi...
- 05/14 07:05 ispace EUROPE and CDS Sign Payload Service Agreement to Transport Prec...
- 05/14 03:00 Murata’s New LCT Redefines Power Supply Noise Suppression, Reducing ...
- 05/14 02:00 ANESSA Launching “ANESSA Sunshine Project” in 12 Asian countries/r...
- 05/14 02:00 FPT Software Partners with Siemens to Provide Low-code Platform Mendix...
- 05/13 14:05 NTT DATA to Provide Digital Transformation Services for Salesforce
- 05/13 14:00 Language-Learning Platform "Native Camp" Launches Unlimited Online Jap...
- 05/13 14:00 Helical Fusion and National Institute for Fusion Science Establish 'HF...
- 05/13 13:00 SambaNova Announces That Fugaku-LLM Is Now a Part of Samba-1
- 05/13 12:00 New LRN Research: Gen Z Employees Twice as Likely to Bend the Rules or...