7B parameters on-premise large language model

About the customer

Our customer is a prominent institution facilitating the integration of AI capacities across various organizations.

The task: Our primary goal was to design a robust large language model with the ability to function on-premise. It was required to incorporate the most recent country-specific data, add a particular language to operate in, and consider the target country’s cultural nuances and legal specifics. According to the deadline, all tasks must be completed within a month.

Embrace innovation with Global Cloud Team’ bussiness competence and services

Our Approach:

Our dedicated team quickly embarked on the task, parsing through terabytes of country-specific data, and scraping billions of web pages. This meticulous attention to detail was crucial in ensuring the model’s training material included up-to-date, language-specific information reflecting the cultural context accurately.

We curated over 60,000 high-quality bilingual training instructions, providing an all-encompassing coverage of key training topics. Understanding the importance of legal compliance and cultural appropriateness, we prepared several auxiliary datasets. These datasets were primarily focused on fine-tuning the mode to maximize the necessary alignment. The model was trained on 1 trillion tokens, using 440 GPUs.

The team deployed a proprietary state-of-art retriever model to ensure that a real-time search to any connected data source is possible and that the data is seamlessly counted in as a part of the prompt before the generator creates a reply.

Results Delivered:

Despite a tight deadline, the team completed the tasks on time. After several stages of training and testing, we reached the planned results. Within a single month, we created an on-premise large language model tailored to meet the customer’s specific needs. The model stands out for its superior quality, up-to-date information, cultural appropriateness, and legal compliance. Our solution has surpassed GPT 3.5 on six benchmarks in the country language and has been successfully deployed, and is now in active use.

Team

We have extensive experience in the development of highly scalable robust distributed platforms. As an example, the largest project was developed by multiple collaborating Outstaff Teams within GCT employing over 70 engineers.

The developed financial services platform supports up to 5 thousand updates per second and serves millions of end-users.

We believe that it takes great people to deliver a great product. top-reasons-first

team
team
team
team

I am here to help you!

Explore the possibility to hire a dedicated R&D team that helps your company to scale product development.

Please submit the form below and we will get back to you within 24 - 48 hours.

Global Cloud Team Form Global Cloud Team Form

Our scalable workforce is specializing in the following areas of software development

Image Line

Revolutionize manufacturing processes and increase productivity with our innovative software solutions

When it comes to developing software for the financial sector, cooperate with GlobalCloudTeam

We have the skills, experience, and resources to develop even the most complex healthcare solution

Strengthen your market position with GlobalCloudTeam eCommerce solutions

Drive innovation in the automotive industry with cutting-edge software development services from GlobalCloudTeam

Explore our solutions

Image Line
How can we help: – Custom Large Language Models (LLMs) training. Get your proprietary on-premise ChatGPT-like model with up-to...
Today AI and machine learning are powerful tools for decision-making, analytics, or automation of manual processes. Their advanced a...
NLP, machine learning, and AI are not new in the IT market. Now they are showing expanding popularity and attracting new companies w...
Explore All