Close Menu
  • Home
  • About
  • News
  • Awards
  • Media & Press
  • Video Podcasts
  • Magazines
  • Events
  • Contact
Facebook X (Twitter) Instagram
Gazet International – Global Magazine
AWARD NOMINATION
  • Home
  • About
  • News
  • Awards
  • Media & Press
  • Video Podcasts
  • Magazines
  • Events
  • Contact
You are at:Home » Inception and MBZUAI Launch AraGen Leaderboard with First Generative Tasks for Arabic LLM ecosystem
Press Release

Inception and MBZUAI Launch AraGen Leaderboard with First Generative Tasks for Arabic LLM ecosystem

By December 6, 20244 Mins Read
Facebook Twitter LinkedIn
Share
Facebook Twitter LinkedIn

Inception, a G42 company specializing in AI-native products, in collaboration with the Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) today announced the launch of AraGen Leaderboard, a framework designed to redefine the evaluation of Arabic Large Language Models (LLMs). Powered by the new internally developed 3C3H metric, this framework delivers a transparent, robust, and holistic evaluation system that balances factual accuracy and usability, setting new standards for Arabic Natural Language Processing (NLP).

Inception and MBZUAI Launch AraGen Leaderboard with First Generative Tasks for Arabic LLM ecosystem

Serving over 400 million Arabic speakers worldwide, the AraGen Leaderboard addresses critical gaps in AI evaluation by offering a meticulously constructed evaluation dataset tailored to the unique linguistic and cultural intricacies of the Arabic language and region. The dynamic nature of this leaderboard tackles challenges such as benchmark leakage, reproducibility issues, and the absence of holistic metrics to evaluate both core knowledge and practical utility.

The introduction of generative tasks represents a groundbreaking advancement for Arabic LLMs, offering a new dimension to the evaluation process. Unlike traditional leaderboards that primarily focused on static, likelihood accuracy-based benchmarks, which fail to capture real-world performance, AraGen’s Leaderboard addresses these limitations. This highlights the transformative impact of the new benchmark in fostering AI innovation and enhancing model performance.

“The AraGen Leaderboard redefines Arabic LLM evaluation, setting a new standard for fairness, inclusivity, and innovation,” said Andrew Jackson, CEO of Inception. “By addressing the gaps in previous benchmarks and introducing generative tasks, the platform empowers researchers, developers, and organizations to create culturally aligned AI technologies. AraGen ensures transparency, reproducibility, and trust while advancing the global NLP landscape.”

The AraGen Leaderboard evaluates models across six dimensions: correctness, completeness, conciseness, helpfulness, honesty, and harmlessness. Featuring 279 questions across tasks like Arabic grammar, general Q&A, reasoning, and safety, it prioritizes the needs of Arabic speakers. Quarterly updates keep the leaderboard relevant while inviting public submissions to enhance model refinement and foster growth in the Arabic AI ecosystem.

“AraGen is a major step towards open, collaborative, and reproducible evaluation of large language models for Arabic, with focus on their text generation capabilities. This contrasts with popular leaderboards, which rely primarily on multiple-choice questions. Moreover, AraGen is a dynamic board with new questions every three months, which makes it much harder to game compared to existing leaderboards,” said Professor Preslav Nakov, Department Chair of Natural Language Processing and Professor of Natural Language Processing, Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)

“Our goal was to create a benchmark that introduces generative task evaluation with a strong emphasis on transparency, reproducibility, and a rigorous measurement of models’ performances,” said Ali El Filali, Machine Learning Engineer at Inception and lead author of this work. “By evaluating models across multiple dimensions to assess both factuality and usability, the AraGen Leaderboard provides actionable insights for diverse NLP tasks. This empowers the Arabic AI community to develop safe and high-performing models for real-world needs that are important to our region. Moreover, AraGen sets a global example by demonstrating how AI benchmarks can prioritize equity and inclusion for underrepresented languages. It’s a step toward ensuring no language or culture is left behind in the AI revolution.”

The Leaderboard delivers detailed performance insights, enabling organizations to confidently select models that align with their requirements. By reducing the need for extensive internal testing, AraGen ensures cost- effectiveness for organizations through a more suitable metric for LLM evaluation, while strengthening trust through its transparent and reproducible methodology.

For more information about the AraGen Leaderboard and submission guidelines, visit.

About Inception

Inception, a G42 company, builds AI-native products that leverage cutting-edge AI research, models, and systems applied to business problems. We pioneer domain-specific AI applications, to deliver AI-driven solutions, across languages and sectors.

About Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)

MBZUAI is a graduate research university focused on artificial intelligence, computer science, and digital technologies across industrial sectors. The university aims to empower students, businesses, and governments to advance artificial intelligence as a global force for positive progress. MBZUAI offers various graduate programs designed to pursue advanced, specialized knowledge and skills in artificial intelligence, including computer science, computer vision, machine learning, natural language processing, and robotics.

For more information, please visit www.mbzuai.ac.ae.

To apply for admission, visit mbzuai.ac.ae or contact admission@mbzuai.ac.ae.

Share. Facebook Twitter LinkedIn
Previous ArticleALUCAST 2024 Draws 8,000 Industry Visitors from 20+ Countries, Featuring 200 Exhibitors and 300+ Brands​
Next Article HITEK Launches Bespoke Housekeeping App for the Hospitality Sector

Related Posts

Farnek Launches Transformational Hybrid Cleaning Unit

May 15, 2025

Pepe Jeans Powers up in Jaipur with its Biggest Indian Store Yet​

May 15, 2025

Offering Credit Intelligence: OneScore Ushers in the New Era of Personal Loans​

May 15, 2025
  • Facebook
  • Twitter
  • Instagram
  • YouTube
  • LinkedIn
Don't Miss

Farnek Launches Transformational Hybrid Cleaning Unit

Pepe Jeans Powers up in Jaipur with its Biggest Indian Store Yet​

Offering Credit Intelligence: OneScore Ushers in the New Era of Personal Loans​

Oliver Healthcare Packaging Opens State-of-the art Manufacturing Facility in Johor to Meet the Needs of Pharmaceutical and Medical Device Companies in Asia-Pacific​

Recent Posts
  • Farnek Launches Transformational Hybrid Cleaning Unit
  • Pepe Jeans Powers up in Jaipur with its Biggest Indian Store Yet​
  • Offering Credit Intelligence: OneScore Ushers in the New Era of Personal Loans​
  • Oliver Healthcare Packaging Opens State-of-the art Manufacturing Facility in Johor to Meet the Needs of Pharmaceutical and Medical Device Companies in Asia-Pacific​
  • Chintamanis Group Sets a New Benchmark in Timeless Luxury Living​
Recent Comments
    Archives
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • October 2024
    • September 2024
    • August 2024
    • July 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • October 2023
    • September 2023
    • January 2021
    Categories
    • Banking
    • Blog
    • Business
    • Corporate
    • Editor's Column
    • Events
    • Executive Spotlight
    • Finance and Investing
    • Lifestyle
    • magazine
    • podcast
    • Press Release
    • Technology
    • World
    Meta
    • Log in
    • Entries feed
    • Comments feed
    • WordPress.org
    About

    GAZET INTERNATIONAL


    Gazet International Magazine is a global entity that works towards providing latest information and news updates of the world. It entraps latest stories in banking, finance, lifestyle and various beats of the world. We engage in recognizing and rewarding the global organizations for their achievements in various fields and deliver justice to the nominees with valued identification and recognition of companies that indulge in the Gazet Award Ceremony.

    Facebook X (Twitter) Instagram YouTube LinkedIn
    Categories
    • Banking
    • Blog
    • Business
    • Corporate
    • Editor's Column
    • Events
    • Executive Spotlight
    • Finance and Investing
    • Lifestyle
    • magazine
    • podcast
    • Press Release
    • Technology
    • World
    Latest posts
    Finance and Investing

    Unemployment rate in the UK much lower than expected in late 2023

    February 5, 2024
    Corporate

    Lufthansa union calls for strike on Wednesday

    February 5, 2024
    Finance and Investing

    Paytm’s crackdown by RBI sends market value down $2.5 billion

    February 5, 2024
    Finance and Investing

    A modest slowdown in US job growth is expected in January

    February 2, 2024
    Previous 1 … 687 688 689 690 691 … 726 Next
    Official Partner

    7ITS NEWS

    Copyright © 2025. Gazet International

    Type above and press Enter to search. Press Esc to cancel.