Research Online Results

All charts and visualisations are created using Lava Metrics.

Early access to our Beta? 👉 Sign up here
 
Marketing AI Performance Leaderboard - June 2025 Results

Research Online Results


Research Online Scores by LLM

🔗 See full results dashboard

What are the overall results?

🏆 Research Online Winner: Gemini: 2.5-Flash-Preview
❌ Research Online Loser: Qwen: qwen-max
 

Individual Test Winners and Losers

Losers by Category ❌

🔗 See full results dashboard
 

Winners by Category 🏆

🔗 See full results dashboard
 

Test Ranking (Best → Worst)

Ranking reflects the average performance per 'research online' test of all LLMs ordered highest to lowest.
  1. Buyer Person Developement
  1. Industry Overview Report
  1. Content Gap Analysis
  1. Competitor Analysis
  1. Market Opportunities and Threats
 

FAQs

What does this Leaderboard represent?
We have designed tests that simulate a marketer’s interaction with native platform UIs (e.g., ChatGPT, Gemini) across several marketing domains:
 
  • Copywriting: Generating ad copy, email subject lines, and social media posts.
  • Internal Data Analysis: Interpreting sample CRM data to identify trends and insights.
  • Strategic Planning: Creating marketing plans based on given scenarios.
  • Online Research: Gathering information from the web to support marketing decisions.
 
How were the tests scored?
Each test output is evaluated by specialised AI “judges.”
 
  • Judges are themselves AI agents configured with specific evaluation criteria.
  • They parse the Test Answer, compare it against expected outcomes or benchmarks, and score on multiple dimensions (e.g., factual correctness, tone, format).
  • Final scores are normalized and aggregated to produce a single value per test.
Where can I see the full results?