2025年第二季度全球人工智能狀況報(bào)告 Artificial Analysis State of AI_第1頁
2025年第二季度全球人工智能狀況報(bào)告 Artificial Analysis State of AI_第2頁
2025年第二季度全球人工智能狀況報(bào)告 Artificial Analysis State of AI_第3頁
2025年第二季度全球人工智能狀況報(bào)告 Artificial Analysis State of AI_第4頁
2025年第二季度全球人工智能狀況報(bào)告 Artificial Analysis State of AI_第5頁
已閱讀5頁,還剩64頁未讀, 繼續(xù)免費(fèi)閱讀

下載本文檔

版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)

文檔簡(jiǎn)介

HighlightsEdition

ArtificialAnalysisStateofAI

Q22025

HighlightsReport

FllpilblAIeTidsbscribcribers

ArtificialAnalysisisaleading,andindependentAIbenchmarkingandinsightsprovider.WesupportengineersandcompaniestounderstandAIcapabilitiesandmakecriticaldecisionsabouttheirAIstrategy.

Ourdata,insightsandpublicationsaregroundedinourcomprehensivebenchmarkingofAItechnologiesandusecases.ThisincludeseverythingfromhourlyperformancetestingoflanguagemodelAPIstomillionsofvotesinourcrowd-sourcedarenas.

Ourpublicwebsite,artificialanalysis.ai,iswidelyreferencedbycompaniesleadinginnovationinAI.Todiscussthisreport,ourpublications,orourservices,pleasegetintouchatcontact@artificialanalysis.ai.

ArtificialAnalysisAITrendsSubscription:ComprehensiveAImarketintelligenceforenterprisedecision-makingfromtheleadingAIbenchmarkingcompany

A|QuarterlyStateofAIReport

|EnterpriseAgentsReport

C|AIAdoptionSurvey

|Databooks&API

ThedefinitivequarterlyupdateonAI

2025istheyearofagents–our

Real-worldadoptioninsightsfrom

Directaccesstotheindustry'smost

marketdevelopments

overviewofwhatmattersmost

thosebuildinganddeployingAI

comprehensivedata

?Emergingtrendsateachlayerofthe

?Comprehensiveanalysisofkey

?Enterpriseusecasepatterns

?ComprehensiveAIperformance

AIstack:hardware,infrastructure,models

?Marketmapsandperformance

rankingsforhundredsofkeyplayers

?StateofAI:China-detailed

agentcategories:coding,deep

research,computeruse,customersupport,sales

?What'sworkingnow:whereagentsaredrivingrealproductivity

?Enterpriseadoptionbenchmarks

?Developerprioritiesandpainpoints

?Model,inferenceandhardwareproviderdemandbyindustry

data-sourcedataforallouranalysis

?Intelligence,performance,cost,surveydataandmore

?ExceldatabooksandAPIaccess

benchmarkingoftopAIlabsin

?Implicationsforreal-world

China

deployment

E|QuarterlyAITrendsWorkshop

Connectmarketintelligencetoyourstrategicpriorities

?LivebriefingwithArtificialAnalysisresearchteam(90minutes,optional)

?What'sworkingnow:insightsfromleadingSFstartupsandenterprises

?Deepdivestailoredtoyourbusinesspriorities(e.g.,codingagentbestpractices,inferenceeconomics,upcomingchips)

F|OngoingTeamAccess

Directaccesstoourresearchteamforsupportandclarifications

?Supportforuseofreportsanddata,includingqueriesonsourcesandmethodology

?Clarificationandexplanationofanalyses

?Limitedtomax90minutesperquarter,ratesavailableforfurthersupport

ArtificialAnalysisis

trustedbytheleadingAIindustryplayersand

publications

OverviewofArtificialAiITrendsSubscription

ThisistheHighlightsVersionoftheQuarterlyStateofAIReportforQ22025,thePremiumVersionisavailabletosubscribersofourAITrendsSubscription

HighlightsVersion(This)PremiumVersion(AITrendsSubscription)

IndustryoverviewandmarketmapofkeyplayersandstrategiesacrosstheAIvaluechain

OverviewoffrontiermodelsrankedbytheArtificialAnalysisIntelligenceIndexandoverviewofemergingtrends

Synthesisofemergingtrendsforimage,videoandspeechmodelsandmarketmaps

SynthesisofemergingtrendsforacceleratorsincludingcasestudycomparingNVIDIAB200andNVIDIAH200using

ArtificialAnalysisSystemLoadTest

IncludeseverythingintheHighlightsVersionplus:

Detailedinsightsacrossnewlanguagemodelreleases(incl.analysisofleadingopenweightsoptions)

Detailedanalysisandcasestudiesoutliningemergingtrendsforlanguagemodelsacrosspricing,performanceandfeatures

Analysisoffrontierimagegenerationmodelsandtrends

(incl.texttoimageandimageediting)

Analysisoffrontiervideogenerationmodelsandtrends(incl.texttovideoandimagetovideo)

Analysisoffrontierspeechmodelsandtrends(incl.texttospeechandspeechtotext)

Emergingmarkettrendsforaccelerators,includingdetailedanalysiscomparingNVIDIAH200andNVIDIAB200

AttachedSeparateReport:EnterpriseAgentsReport

coveringcomprehensiveanalysisofkeyagentcategoriesandimplicationsforreal-worlddeployment

Feelfreetogetintouchwithusat

subscriptions@artificialanalysis.ai

tolearnmoreabouttheArtificialAnalysisAITrendsSubscription

ArtificialAnalysisStateofAIQ22025

ThestoryofAIinQ22025revealsanindustryhittingitsstrideafteryearsoffoundationaldevelopment.WearewitnessinganewphasewhereinnovationsacrosstheAIstackarematuringandconvergingtowardsimpactinghoweveryorganizationoperates.

Today'smodelsdemonstratesignificantintelligencegainswhile

becomingmorecost-effectiveandfasterthanever.Agenticworkflowsaremovingfrompromisingexperimentstoproductionreality,with

codingagentsproliferatingacrossdevelopmentteams.Meanwhile,thecompetitivelandscapecontinuestoevolve,withChineseAIlabsdemonstratingremarkableleadershipinbothlanguageandvideo

capabilities.

ProducedbyArtificialAnalysis,anindependentbenchmarkingandinsightsfirmtrustedacrosstheAIvaluechain,thisQ22025reportisdesignedtoinforminvestment,product,andpolicydecisionsinanincreasinglyAI-nativeworld.

Formoredetails,contactusat

founders@artificialanalysis.ai

5

-MicahHill-SmithandGeorgeCameron,FoundersofArtificialAnalysis

Contents

1.IndustryOverview

OverviewofmarketmovementsandtrendsbykeyplayersintheAIindustry

2.LanguageModels

Trendsinfrontierlanguagemodels,including

hybridmodels,costandefficiencyimprovements

3.ImageandVideo

Trendsinfrontierimageandvideomodels

includinganoverviewoftheleadingmodelsinArtificialAnalysisImageandVideoArenas

4.SpeechandAudio

TrendsacrossnewspeechmodelsandanoverviewofnewandleadingmodelsintheArtificialAnalysisSpeechArena

5.Accelerators

OverviewoftheAIacceleratormarketincludingmarkettrends,available

acceleratorsandverticalintegrationbyselectchipmakers

01

7

Toolsandconnections

enablesmartworkflow

integration

Nativeconnectionsandtoolsinchatinterfacesarenowshiftingworkloadstoagenticapproaches

Languagemodelscontinuetobecomemoreintelligent

MajorAIlabshaveallcontinued

tomakesubstantialgainsin

intelligence,costefficiencyand

speed

Codingagentsrapidly

proliferateacross

developmentworkflows

Q2saw12majorcodingagent

launches,includingfrommajorlabs

5majortrendshave

shapedtheStateofAI

acrossQ22025

Videomodelssee

breakthroughsandrapid

qualityincrease

GoogleVeo3’srelease

showcasesaudio-video

breakthroughs,drivingadoption

andnewusecases

Chinacontinuesto

demonstrateleadershipin

languageandvideo

ModelsfromChineseAIlabs

occupytopspotsforopen

weightslanguagemodelsandon

thevideoleaderboard

PlayersintheAIvaluechaindifferinlevelsofverticalintegration;GooglecontinuestostandoutasthemostverticallyintegratedfromTPUacceleratorstoGemini

KeyPlayersintheAIValueChain(Non-Exhaustive)

No

presence

Strongpresence

Classificationsareindicativeanddeterminedbasedonarangeoffactorsincludingmarketshareandstrengthofoffering

Anthropic

Microsoft

DeepSeek

Snowflake

Databricks

SambaNova

Together.ai

Fireworks

DeepInfra

OpenAI

Google

Amazon

Alibaba

Perplexity

Cohere

Cerebras

Nebius

Meta

Mistral

NVIDIA

Groq

AMD

xAI

Applications

Foundation

models(firstparty)

Cloud Inference(firstparty)

AcceleratorHardware

8

Source:Companywebsite

BigtechnologycompaniesarecontinuingtoplayacrossallAImodalitieswhilesmallerchallengerstendtofocusonspecificmodalities

Keyplayerswithfirst-partymodelsbytypeofAINomodelExistingmodel

Language

Speech

Image

Video

Anthropic

Microsoft

OpenAI

GGoogle

Mistral

Amazon

NVIDIA

Meta

xAI

Adobe

ElevenLabs

Perplexity

Alibaba

Bytedance

Tencent

Baidu

DeepSeek

Kuaishou

MiniMax

Cohere

Midjourney

AI21Labs

AI21labs

9

Source:Companywebsite

01.IndustryOverview

AnumberofAILabsnowhavemodelsnearthefrontierofintelligence;xAIhastheleadingmodelwithGrok4,achievingthisfeatinlessthan500dayssincetheirfirstmodels'launch

FrontierLargeLanguageModel(LLM)Intelligence,OverTime

ArtificialAnalysisIntelligenceIndexv2(incorporatesMMLU-Pro,GPQADiamond,Humanity'sLastExam,LiveCodeBench,SciCode,AIME2024,MATH-500)

o3-Pro

Gemini2.5Pro

Grok4

DeepSeekR10528

Claude4

Opus(Extended

Thinking)

G

Llama4Maverick

?xAIleadstheintelligencefrontierforthefirsttime:xAIGrok4achievesthehighestintelligencescore(73)ontheArtificialAnalysisIndex,surpassingOpenAI'so3-pro(71),GoogleGemini2.5Pro(70),andDeepSeekR1(68)

?Open-sourcemodelsreachfrontierperformance:DeepSeekR1ranksamongthemostintelligentmodelsglobally,provingopen-weightsarchitecturescancompetewithproprietarysolutions

?OpenAI’sleadfaceschallenge:TheintelligencefrontierisnowfiercelycontestedbymultipleAIlabs,challengingOpenAI’slong-heldleadership

10

Source:ArtificialAnalysisindependentbenchmarking

02

02.LanguageModels

xAI,OpenAI,andGoogleleadfrontierintelligencewiththeirlatestreasoningmodels,followedcloselybyotherlabs

LeadingLargeLanguageModels(LLMs),byAIlab

Commentary

HighestArtificialAnalysisIntelligenceIndexv2achievedbyeachAILab

NON-EXHAUSTIVE

?OpenAIlosesfrontierforthefirsttime:xAI’sGrok4isnowthemostintelligentlanguagemodel,sittingaheadofo3-

pro,OpenAI’sfrontiermodel

?xAI,OpenAIandGoogleleadfrontierintelligence:Latestreasoningmodelsfromthreelabsholdthetop5positions

?Reasoningmodelscontinuetodominate:Q2‘25

continuestoseereasoning

modelssolidifytheirpositionastheclearestpathtohigherintelligenceindexscores

?Globalcompetition

intensifies:Labslike

DeepSeek,MiniMax,and

Alibabacontinuetoclosegap

12

Source:ArtificialAnalysisindependentbenchmarking

02.LanguageModels

Models:Overthepastyear,OpenAIhasmaintaineditslead,GoogleGeminiandDeepSeekhavesurged,andMetaLlamaandMistralhavefallen

DemandforTop10LLMFamiliesinMay2025

+1%

84%83%

+49%

80%

+21%

WhichLLMfamiliesareyouusingorconsideringusing?N=270(2024)and591(2025)

20252024

Changebetween2024&2025,p.p.

fromArtificial

AdoptionSurvey

H12025

67%

+53%

-6%

53%

49%

46%

43%

-15%

+31%

37%

31%

+17%

31%+25%

25%

+14%

21%

22%

14%

4%

0%DeepSeek

0%

PerplexitySonarMicrosoftPhi

0%0%

GoogleGemini

OpenAI(GPT/o)

AnthropicClaude

MetaLlama

xAIGrokAlibabaQwenMistral

13

Source:ArtificialAnalysisAIAdoptionSurvey–H12025

02.LanguageModels

OpenSource:Openweightslanguagemodelscontinueto

improve,thegaptoleadingproprietarymodelsstayedsimilar

LeadingLanguageModelsbyLicenseType,OverTime

Commentary

ArtificialAnalysisIntelligenceIndex(incorporatesMMLU-Pro,GPQA,Humanity'sLastExam,LiveCodeBench,SciCode,AIME,MATH-500)

o3-pro

Gemini2.5Pro

Grok4

DeepSeek-R10528

DeepSeekR1

?Openweightsclosethegaptoproprietary

models:Thereleaseof

DeepSeekR10528inMayfurtherreducedthe

intelligencegaptoleadingproprietarymodelsfrom

GoogleandOpenAI

(similartoreleaseof

DeepSeekR1);thereleaseofGrok4hassince

widenedthisgap

?Proprietaryandopen

weightsmodelscontinuetheirrapidrelease

cadence:Q2‘25

continuedtoseefrequentincrementalimprovementsdrivethefrontier

14

Source:ArtificialAnalysisindependentbenchmarking

02.LanguageModels

OpenSource:LeadingproprietarymodelsarefromUSlabs,whileChinaleadstheopenweightsintelligencefrontier

LeadingLanguageModelsbyLicenseType

Commentary

ArtificialAnalysisIntelligenceIndexv2(incorporatesMMLU-Pro,GPQA,Humanity'sLastExam,LiveCodeBench,SciCode,AIME,MATH-500)

?Proprietarymodels

continuetoleadfrontierintelligence:Proprietary

reasoningmodelsfromUSlabsleadinoverall

intelligence

?Chinademonstratesopenweightsleadership:

NON-EXHAUSTIVE

Leadingopenweights

modelsarefromChineseAIlabs(DeepSeek,MiniMax,

Alibaba,Moonshot)

?Proprietarymodels

marginallyleadfornon-

reasoningmodels:Claude

4Opusiscurrentlythe

mostintelligentnon-

reasoningmodel,followedcloselybyKimiK2

15

Source:ArtificialAnalysisindependentbenchmarking

02.LanguageModels

Countryview:ModelsfromlabsintheUSandChinacontinuetodominatetheintelligencefrontier

LeadingLanguageModelsbyCountryofOrigin

ArtificialAnalysisIntelligenceIndexv2(incorporatesMMLU-Pro,GPQA,Humanity'sLastExam,LiveCodeBench,SciCode,AIME,MATH-500)Commentary

?USmaintainsleadershipin

frontierreasoning:US-basedlabscontinuetoholdthetop

spotsontheIntelligenceIndexwiththeirpremierreasoning

modelslikeGrok4,o3-proandGemini2.5Pro

?Q2sawlimiteddisruption

fromothercountries.Francemaintainsapresencewith

MagistralMedium,while

UpstageAI’sSolarPro2modelbroughtSouthKoreatothe

frontierforthefirsttime

?Overall,theglobalfrontier

remainshighlyconcentrated,withtheUSandChina

continuingtodefinethepaceanddirectionofcutting-edgemodeldevelopment

16

Source:ArtificialAnalysisindependentbenchmarking

17

02.LanguageModels

…computedemandcontinuestoincrease

Whileefficiencygainshavebeenmade…

DeepDivenext

Newapplicationscontinuetodemandmorecompute:asingledeepresearchquerycancost>10xanoriginalGPT-4query

GPT-4levelintelligenceisnow100xcheaperthanoriginalGPT-4

A.SmallerModels

B.SoftwareEfficiency

Algorithmicandtrainingdataimprovements

C.HardwareEfficiency

Nextgenerationacceleratorsoffer

haveallowedsmaller

Inferenceoptimizations(e.g.FlashAttention)

modelstogetsmarter

improveefficiency

~1/3x

compute

~1/10x

compute

morecompute

efficiency

~1/3x

costs

~20x

requests/use

F.AIAgents

~10x

Agentschainmultiple

tokens/query

~5x

requeststoLLMsto

completetasks

autonomously

E.ReasoningModels

compute/query

D.LargerModels

Scalinglawscontinue

todemandhigher

parametercountsfor

greaterintelligence

Significantincreasein

outputtokenswhen

Figuresarehighlyindicativeandservetoillustratethedirectional

impactofeachfactorimpactingcost

models‘think’before

answering

02.LanguageModels

B.SoftwareEfficiency:EfficientmodelscombinedwithnewacceleratorskeptslashingAI

LanguageModelInferencePricingbyIntelligenceClass,OverTime

PriceinUSDper1milliontokens(blendedinputtooutputtokenprice3:1);ArtificialAnalysisIntelligenceIndexv2(incorporates7evaluations)

Commentary

inferencecoststhroughoutQ2

GPT-4

GPT-4o

NON-EXHAUSTIVE

?

Q22025acceleratesthe

slideininferencecost:fromApriltoJune,pricesfell

acrosseveryintelligence

bandasDeepSeekR10528,Qwen38B,andGemma3nE4BInstructslashedcosts

whileliftingscores

GPT-3.5Turbo

o1-mini

DeepSeekR1DistillLlama8B

Gemini2.0FlashLite

DeepSeek-R10528

Qwen38B

Gemma3n

E4BInstruct

/

?

CapableAIisbecoming

moreaccessibleand

commoditized:duringQ2

2025,thepriceoffrontier-

levelinference(Intelligence

Index≥50)droppedbynearly75%,slidingfrom$0.26to

just$0.063permilliontokens

18

Source:ArtificialAnalysisindependentbenchmarking

02.LanguageModels

B.SoftwareEfficiency:ThroughputsignificantlyincreasedinQ22025acrossmodel

classes,butend-userwaittimesaresometimesgrowingduetolongreasoningchains

LanguageModelOutputSpeedbyIntelligence,OverTime

Totaloutputtokenspersecond,ArtificialAnalysisIntelligenceIndexv2(incorporates7leadingevaluations)

?AQ22025speedsurge

overcamethetrade-off

againstintelligence:a

significantleapin

inferenceperformance

occurredinthesecond

quarterof2025,andnewreleasesmadehighly-

intelligentmodels(Index

>=50)thefastestcategoryforthefirsttime

?Latencyparadox:despitehigherthroughput,end-to-endusecanbeslowerasreasoningandagentic

tasksgeneratetensof

thousandsoftokensandchainmultiplecalls,fullyoffsettingspeedgains

Commentary

NON-EXHAUSTIVE

Gemini2.5Flash-Lite(Reasoning)——

Gemini2.5Flash-Lite

NovaMicro——Gemini1.5Flash-8B

19

Source:ArtificialAnalysisindependentbenchmarking

02.LanguageModels

E.ReasoningModels:Reasoningcoststimeandcompute:reasoningmodelsuseupto

10xmoretokenstorespondtothesamepromptsasnon-reasoningmodels

OutputTokensUsedtoRunArtificialAnalysisIntelligenceIndex

ArtificialAnalysisIntelligenceIndexv2(incorporates7leadingevaluations),OutputTokensUsedinArtificialAnalysisIntelligenceIndex(~5Minputtokens)Reasoningmodels

NON-EXHAUSTIVE

~78M1

Avg.totaloutputtokensfor

reasoningmodels

~10M1

Avg.totaloutputtokensfornon-

reasoningmodels

20

1.BasedonrepresentativemodelsincludedinthechartSource:ArtificialAnalysisindependentbenchmarking

21

02.LanguageModels

InQ22025wesawincreaseduseofagenticworkflowsandexplosivegrowthincodingagents,bothenabledbyaconnectionecosystemandnewmodeltrainingapproaches

KeyThemesinQ2‘25

Applicationsmovetowards‘a(chǎn)genticbydefault,

?

?

AgenticworkflowscontinuetobecomeembeddedinawiderangeofAIapplicationsthatpreviouslyusedlinearexecutionandminimaltooluse,suchaschatbots,terminals,anddataanalysistools

DeepresearchagentsbecametablestakesformajorchatbotsandsomesmallerChineselabentrants

Ecosystemofconnections

?

ApplicationssuchasChatGPTandClaudeexpandedtheirsuiteofintegrations,bothwithinternallydevelopedtools

continuestogrowand

enablenewfunctionality

?

andincreasingModelContextProtocol(MCP)compatibility

FirstandthirdpartyMCPserversproliferatedacrossarangeofAPIsandbusinesses,withstrongusageindeveloperandconsumerapplications

Codingagentsseerapidgrowth

?

?

Q2sawanunprecedentedvolumeofnewcodingagentproducts,with12majorcodingagentlauncheswithinthequarter,includingfrontierlabproductslikeOpenAICodexandGeminiCLI

Codingagentusagehasrapidlygrown;approximatelyhalfoftherespondentstotheArtificialAnalysisAIAdoptionSurveyuseorareconsideringusingCursor

AgenticmodelusewilldriveupLLMusagecosts

?

Agentsincuradditionalusageoftokensandtools,drivingincreasedcosts;deepresearchAPIslaunchedinQ2havedemonstratedcostsofupto$28forasinglecomplexqueryintesting

Trainingfocusesonagentsandlong-horizontooluse

?

?

Reasoningmodelsandreinforcementlearninghasenabledmoreeffectivetooluse,includinginterleavedwithmodelthinkingbeforeproducingresponsestousers

Modelcreatorsincreasedtheemphasisintrainingonlong-runningtasksandagenticworkflowsformodelssuchastheClaude4familyandKimiK2

02.LanguageModels

AgentsareautonomoussystemsdrivenbyLLMs…

Whatareagents?

“SystemswhereLLMsdynamicallydirecttheirownprocessesandtoolusage,maintainingcontroloverhowtheyaccomplishtasks”

“Agentsrepresentsystemsthatintelligentlyaccomplishtasks,rangingfromexecutingsimpleworkflowsto

pursuingcomplex,open-endedobjectives”

“AIagentsareautonomoussystemspoweredbylargelanguagemodels(LLMs)that,givenhigh-level

instructions,canplan,usetools,carryoutstepsof

processing,andtakeactionstoachievespecificgoals”

AIagentsareLLM-drivensystemsthatact

autonomouslyandusetoolstocomplete

tasksend-to-end

actions

1.CanincludedirectvisioncapabilityorprocessingofotherdatasuchaswebsiteHTMLtoidentifySource:Companywebsite;Anthropic‘Buildingeffectiveagents’Agentworkflowdefinition

Fundamentally,agentsineverydomainruninaloopandtake

actionsbyusingtools,suchassearchingtheweborwritingtoafile

User

Agentdecideswhen

thetaskis‘complete’

Usersmakeinitialrequests,andthe

agentmayengagetheminfurther

turnswhereneeded(e.g.,toclarify)

Agent

Completetask

LargeLanguageModel

Toolsetandenvironment(exampletoolinclusions)

Filesystemaccess

APIintegrations

Codeexecutionenvironments

MCPservers

02.LanguageModels

Builtontheimprovingintelligenceoflanguagemodels,AIagentsofferkeybenefitscomparedtotraditionalworkflowsandareseeingsuccessinseveraldomains

OverviewofagentbenefitsKeydomainsshowingearlysuccesses

Finds,interprets,editsandtestssourcecodetocompletesoftwareengineeringtasks

Coding

AgenticapproachesenablenewAI-basedapplicationsduetoarangeofkeybenefitscomparedtostaticworkflows:

Deep

research

Parses(andpotentiallyclarifies)aresearchqueryandlaunchesachainoftargetedresearchquerieswhile

controllingitsresearchflowtosynthesizeananswer

Computeruse

Interpretsusercommands,‘looks’atadesktopor

browserwindow1,andautonomouslychainsclicks,keystrokes,shellcommands,andAPIcallsto

completearbitrarytask

Customersupport

Customeragentinlivespeechortextchatwhich

identifiesintentandrespondstocustomersinreal

time,whilechainingrequiredapp,CRMorAPIcallstocompletethetask(orhandofftoahumanagent)

1.Dynamicplanning,tasktracking,andexecutionfor

complexunknowntaskrequirementstopursuewell-definedgoals

2.Integrationwithawiderangeofsystemsandprocesses

acrossadomainwithoutaclearsequenceofdependenciesor‘chains’ofusetocompletetasks

3.Naturalcollaborationtocompletetasks,includingengaginghumanusersinthelooptoclarifyorcontinuetasks,or

coordinatingwithotheragentswithadditionalcapabilities

SalesIdentifiespotentialleads,executespersonalized

outreach,andintegrateswithsalestools

4.Gracefulerrorrecoveryfromfeedbackwhereerrorsoccur,evenwithuniqueorunexpectedfailuremodes

ForadeepdiveonthelatestprogressinAIagents,seetheArtificialAnalysisQ2AgentsandApplicationsReport

02.LanguageModels

Severalcompetingplayersareemerginginthebigagentdomainsin

2025;leadinglabsarefocusedoncoding,research,andcomputeruse

NON-EXHAUSTIVE

Frontierlabproduct

Domain

Illustrativeproducts&players

ACoding

母GitHubcopilot

I-replit

BDeepresearch

上?MistralAl

CComputeruse

ADEPT

Gcomet&brouseruse

D

Customersupport

漲Fin

?freshworks診Decagon

E

Sales

M11x

ppersanaAl

AisDR

?RelevanceAl

Thesedomainsareshowingthemostprogressin

commercialoff-the-shelfproductsandresearch

previews,whileotherusecasesarefollowing.

Inparallel,arangeofprovidersareenablinguserstobuild

customAIagentsfortheirusecases:

?RelevanceAl

stackAl

02.LanguageModels

A.AICodingTools:GitHubCopilotandCursordominatethemarketasthemostpopular

AIcodingtools,withasignificantleadoverClaudeCodeandGeminiCodeAssist

DemandforCodingTools

WhichAItoolsareyouusingorconsideringusingthisyear?N=955

ExcerptfromArtificial

AnalysisAIAdoptionSurvey

ReportH12025

25

Source:ArtificialAnalysisAIAdoptionSurvey–H12025

26

02.LanguageModels

A.Codingagents:Codingagentlauncheshaveacceleratedin2025,withalargefocusoncommand-lineinterfaces

NON-EXHAUSTIVE

Majorcodingagentproductlaunchesbyquarter

Numberofcodingagentlaunchesidentified,basedonprimaryformfactor1

IDEextensionCloudcodingagent

DedicatedIDEAppbuilder

Localnon-IDE(incl.CLIs)

12

2

3

NotablereleasesinQ2included:?GitHubCopilotCodingAgent?CursorBackgroundAgents

?OpenAICodexCLIandcloud?GoogleGeminiCLIandJules

InJuly2025,AmazonlaunchedKiro,anAIIDEinpublicpreview

6

5

1

1

2

2

5

3

1

2

3

1

1

1

1

2

1

1

1

1

1

1

1

1

1

0

1

2

2023-Q12023-Q22023-Q32023-Q42024-Q12024-Q22024-Q32024-Q42025-Q12025-Q2

Releasesinclude

yz

replit

windsurf的augmentcode

ROOCOOE

OpenAI

Codex

Backgroundagents

?GitHubcopilot

1.Primaryformfactorisassessedqualitativelywhenmultiplemodesapply

Note:Releasetimingsarebestestimates,basedongeneralorpublicavailabilitywherepossible.Whererelevant,timingisbasedonwhenAIcodingagentsbecameacoreproductcapability

03ImageandVideoModels

StateofAI–Q22025

03.ImageandVideoModels

Q2‘25sawashiftinprogresstoVideomodels,withaudiosupportandbreakthroughsinquality,whileopenweightsmodelprogressslowedinbothimageandvideo

KeyThemesinQ2‘25

Videomodelsbegintosupportaudio

?

?

Veo3releasedinMay2025becomesthefirsthighquality,mainstreammodelthatnativelysupportedaudiogenerationaspartofavideomodel,drivingstrongadoption

Veo3,sdifferentiatedaudiosupportgivesitstrongpricingpowerat$0.75/sof720pvideowithaudio,surpassingcomparablemodelssuchasSeedance1.0at~$0.13/sof1080pvideo,andHailuo2at~$0.08/sof1080pvideo

Videomodelsseecontinuedbreakthroughsinquality

?

?

?

Videomodelsseeabreakthroughinquality,withSeedance1.0overtakingbothQ1leaders:Veo2intexttovideoby~150ELO,andKling1.6Proinimagetovideoby~200ELOpoints

Labsshiftfocustoimagetovideogenerations,withalargerELOjumpthantexttovideo,andmodelssuchasMidjourneyV1andKling2.1Proavailableonlyinimagetovideovariants

Openweightsvideomodelslagbehindproprietaryalternatives,withAlibabaWan2.1stillrepresentingtheSOTAforopenweightstexttovideogenerationandLTXVideov0.9.713Branking16thoverallforimagetovideoontheArtificialAnalysisleaderboard

Imageeditingmodelslaunched

?

?

Instructionbasedimageeditingmodelsbecomepopular,withGPT-4ocontinuingtoholdthelead,butFLUX.1Kontext[max]andHiDream-E1.1launchedascompetitivemodelsinQ2

Openweightsima

溫馨提示

  • 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
  • 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
  • 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
  • 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
  • 5. 人人文庫網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
  • 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
  • 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。

評(píng)論

0/150

提交評(píng)論