Blog & Articles

Perspectives, news, and technical reports from our community.

Blog Posts & Articles

AI Alliance Accelerating Open-Source AI Innovation with Llama Stack

We are excited to announce a deeper collaboration between the AI Alliance and Meta’s Llama Stack, marking a significant milestone in advancing open-source AI development. The AI Alliance officially supports Llama Stack as a foundational AI application framework designed to empower developers, enterprises, and partners in building and deploying AI applications with ease and confidence.

DoomArena: A Security Testing Framework for AI Agents

Technical Report

The AI Alliance releases new AI-powered programming language and industrial AI agent framework, adds new Japanese members, and launches AI Alliance Japan  

The AI Alliance announced three developments: Dana, an AI-powered programming language that generates code from natural language descriptions; OpenDXA, an open-source agent framework for industrial AI applications; and AI Alliance Japan, a regional working group with nine founding members including IBM, NEC, and Panasonic focused on sovereign AI development. Dana introduces intent-driven development where developers describe functionality rather than write traditional code, while OpenDXA targets complex industrial workflows with explainable AI. The Japan initiative will focus on manufacturing, semiconductor, and navigation applications, with their first project supporting LLM-jp, Japan's national language model. All projects are open-source and available through the AI Alliance collaboration.

The AI Alliance Forms Non-profit AI Lab and AI Technology & Advocacy Association to Scale Open-Source Innovation 

New legal entities and boards intend to scale the AI Alliance’s mission to support and perform open-source development, open research, education, and advocacy for AI globally. 

Screenshot AI Alliance Association Statement June 10 2025

AI Alliance Urges Lawmakers to Rethink the NY RAISE Act

News

LLM-as-a-Judge Without the Headaches: EvalAssist Brings Structure and Simplicity to the Chaos of LLM Output Review

Technical Report

Evaluating AI model outputs at scale is a major challenge for teams using LLMs, especially when assessing nuanced qualities like politeness, fairness, and tone that traditional benchmarks miss. IBM Research has released EvalAssist, an open-source tool that streamlines the "LLM-as-a-Judge" approach, allowing teams to define custom evaluation criteria and apply them at scale using models like GPT-4 or IBM's Granite. The platform offers multiple evaluation strategies including direct assessment and pairwise comparison, while providing transparency through chain-of-thought explanations and bias detection. Built on IBM's Unitxt toolkit, EvalAssist aims to make AI evaluation more rigorous, scalable, and trustworthy for real-world applications.

Mastering Data Cleaning for Fine-Tuning LLMs and RAG Architectures

News

In the rapidly advancing field of artificial intelligence, data cleaning has become a mission-critical step in ensuring the success of Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) architectures. This blog emphasizes the importance of high-quality, structured data in preventing AI model hallucinations, reducing algorithmic bias, enhancing embedding quality, and improving information retrieval accuracy. It covers essential AI data preprocessing techniques like deduplication, PII redaction, noise filtering, and text normalization, while spotlighting top tools such as IBM Data Prep Kit, AI Fairness 360, and OpenRefine. With real-world applications ranging from LLM fine-tuning to graph-based knowledge systems, the post offers a practical guide for data scientists and AI engineers looking to optimize performance, ensure ethical compliance, and build scalable, trustworthy AI systems.

abstract gradient

Feedback on the Draft Report by Joint California Policy Working Group on AI Frontier Models

News
abstract gradient

AI Alliance Comment in Response to Japan Fair Trade Commission Discussion Paper on Generative AI and Competition

News
abstract gradient

AI Alliance Comment in Response to the RFI on the Development of an AI Action Plan

News
abstract gradient

The AI Alliance Comment on NIST AI 800-1 Initial Public Draft: “Managing Misuse of Dual-Use Foundation Models”

News
V0.1 of the OTDI dataset specification

Announcing the Open Trusted Data Initiative (OTDI) draft v0.1 dataset specification

Announcing the Open Trusted Data Initiative (OTDI) draft v0.1 dataset specification...