Ein neuer Benchmark aus China testet KI-Modelle auf ihre Fähigkeit, reale Aufgaben zu lösen. Er soll Unternehmen bei Investitionsentscheidungen in KI helfen.

https://www.heise.de/news/Xbench-Chinesischer-KI-Benchmark-prueft-Modelle-auf-Alltagstauglichkeit-10460761.html?wt_mc=sm.red.ho.mastodon.mastodon.md_beitraege.md_beitraege&utm_source=mastodon

heise online · Jun 26Xbench: Chinesischer KI-Benchmark prüft Modelle auf AlltagstauglichkeitBy Caiwei Chen

#Benchmark #IT #KünstlicheIntelligenz

**c't Magazin** @ct_Magazin@social.heise.de · Jun 17

Jun 17

c't Magazin @ct_Magazin@social.heise.de

heise+ | Wie c't Grafikkarten testet: Spiele-Benchmarks, Lautstärke, Leistungsaufnahme

Rechenleistung, Speichermenge, Displaytechnik sowie die Lautheit des Kühlers sind Kenngrößen für Grafikkarten. Wir stellen unser aktuelles Testverfahren vor.

https://www.heise.de/hintergrund/Wie-c-t-Grafikkarten-testet-Spiele-Benchmarks-Lautstaerke-Leistungsaufnahme-10439958.html?wt_mc=sm.red.ho.mastodon.mastodon.md_beitraege.md_beitraege&utm_source=mastodon

c't MagazinWie c't Grafikkarten testet: Spiele-Benchmarks, Lautstärke, LeistungsaufnahmeRechenleistung, Speichermenge, Displaytechnik sowie die Lautheit des Kühlers sind Kenngrößen für Grafikkarten. Wir stellen unser aktuelles Testverfahren vor.

#Benchmark #Entertainment #Spiele

**James House-Lantto (He/Him)** @Theeo123@mastodon.social · Jun 13 *

Jun 13 *

James House-Lantto (He/Him) @Theeo123@mastodon.social

https://www.gamingonlinux.com/2025/06/3dmark-are-planning-a-linux-version-but-no-date-for-it-yet/

3D Mark will be releasing a Linux version of their benchmarking tool, which is good news, no info on exactly when however.

GamingOnLinux · Jun 133DMark are planning a Linux version but no date for it yetBy Liam Dawe

#3Dmark #Benchmark #Linux

**heise online English** @heiseonlineenglish@social.heise.de · Jun 11

Jun 11

heise online English @heiseonlineenglish@social.heise.de

Nvidia's PC processor N1X in Geekbench

Initial entries in a benchmark database attest to Nvidia's upcoming N1X 20 CPU cores and more than 4 GHz clock frequency.

https://www.heise.de/en/news/Nvidia-s-PC-processor-N1X-in-Geekbench-10440781.html?wt_mc=sm.red.ho.mastodon.mastodon.md_beitraege.md_beitraege&utm_source=mastodon

heise online · Jun 11Nvidia's PC processor N1X in GeekbenchBy Mark Mantel

#ARM #Benchmark #Prozessoren

**heise online** @heiseonline@social.heise.de · Jun 11

Jun 11

heise online @heiseonline@social.heise.de

Nvidias PC-Prozessor N1X im Geekbench

Erste Einträge in einer Benchmark-Datenbank attestieren Nvidias kommendem N1X 20 CPU-Kerne und mehr als 4 GHz Taktfrequenz.

https://www.heise.de/news/Nvidias-PC-Prozessor-N1X-im-Geekbench-10440734.html?wt_mc=sm.red.ho.mastodon.mastodon.md_beitraege.md_beitraege&utm_source=mastodon

heise online · Jun 11Nvidias PC-Prozessor N1X im GeekbenchBy Mark Mantel

#ARM #Benchmark #Prozessoren

**Piotr Migdał** @pmigdal@mathstodon.xyz · May 16

May 16

Piotr Migdał @pmigdal@mathstodon.xyz

Which AI models are best across 28 benchmarks?
Turns out, Gemini 2.5 Pro from Google rocks!

This chart shows Elo ratings for "would model A beat model B in a benchmark".

Data by @scaling01, I created this chart with #QuesmaCharts.

#AI #LLM #benchmark

**heise online** @heiseonline@social.heise.de · May 14

May 14

heise online @heiseonline@social.heise.de

KI-Update: KI im Gesundheitswesen, Apple Intelligence, TikTok, Papst Leo zu KI

Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.

https://www.heise.de/news/KI-Update-KI-im-Gesundheitswesen-Apple-Intelligence-TikTok-Papst-Leo-zu-KI-10382777.html?wt_mc=sm.red.ho.mastodon.mastodon.md_beitraege.md_beitraege&utm_source=mastodon

heise online · May 14KI-Update: KI im Gesundheitswesen, Apple Intelligence, TikTok, Papst Leo zu KIBy Marko Pauli

#AppleIntelligence #Benchmark #DigitalHealth

**c't Magazin** @ct_Magazin@social.heise.de · Apr 28

Apr 28

c't Magazin @ct_Magazin@social.heise.de

heise+ | Duell in der Mittelklasse-CPUs: Intel Core Ultra 200S gegen Ryzen 9000 im Test

Bei den Arrow-Lake-CPUs hat Intel nicht nur ein neues Namensschema eingeführt. Wir testen, wie gut sich die günstigen 65-Watt-Modelle gegen Ryzen 9000 schlagen.

https://www.heise.de/tests/Duell-in-der-Mittelklasse-CPUs-Intel-Core-Ultra-200S-gegen-Ryzen-9000-im-Test-10313447.html?wt_mc=sm.red.ho.mastodon.mastodon.md_beitraege.md_beitraege&utm_source=mastodon

c't MagazinDuell in der Mittelklasse-CPUs: Intel Core Ultra 200S gegen Ryzen 9000 im TestBei den Arrow-Lake-CPUs hat Intel nicht nur ein neues Namensschema eingeführt. Wir testen, wie gut sich die günstigen 65-Watt-Modelle gegen Ryzen 9000 schlagen.

#AMD #AMDRyzen #Benchmark

**LinuxNews.de** @linuxnews@social.anoxinon.de · Apr 23

Apr 23

LinuxNews.de @linuxnews@social.anoxinon.de

OCCT Diagnosetool jetzt für Linux verfügbar
https://linuxnews.de/occt-diagnosetool-jetzt-fuer-linux-verfuegbar/ #monitoring #stresstest #benchmark

LinuxNews.de · Apr 23OCCT Diagnosetool jetzt für Linux verfügbar

More from

LinuxNews.de

**it's B! Cavello** @b_cavello@mastodon.publicinterest.town · Apr 21

Apr 21

it's B! Cavello @b_cavello@mastodon.publicinterest.town

Has anyone created an ML #benchmark for generated code #accessibility yet?

**Linux** @Linux@linuxrocks.online · Apr 14

Apr 14

Linux @Linux@linuxrocks.online

Linux providing a better gaming performance than Microsoft Windows is no longer of any kind of anomaly

AMD Radeon RX 9070 XT / Linux kernel 6.14 / Mesa 25 benchmarked on Arch Linux (Steam OS bases on BTW) vs. Windows 11.

https://youtu.be/KY7E-pj7UYc

youtu.be- YouTubeEnjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.

#Linux #vs #Windows11

**Olly** @Olly42@nerdculture.de · Apr 12

Apr 12

Olly @Olly42@nerdculture.de

People are using Super Mario to benchmark AI now.

Hao AI Lab, a research org at the University of California San Diego, threw AI into live Super Mario Bros. games. Anthropic’s Claude 3.7 performed the best, followed by Claude 3.5. Google’s Gemini 1.5 Pro and OpenAI’s GPT-4o struggled.

https://github.com/lmgame-org/GamingAgent

#gamingagent #supermario #llm

**James House-Lantto (He/Him)** @Theeo123@mastodon.social · Apr 8

Apr 8

James House-Lantto (He/Him) @Theeo123@mastodon.social

https://www.theverge.com/meta/645012/meta-llama-4-maverick-benchmarks-gaming

Meta gets caught cheating at AI benchmarks.

Short version, they submitted a different/tweaked version of their new Llama 4 models to benchmarking sites, than what they actually make available to the public.

The Verge · Apr 8Meta got caught gaming AI benchmarksBy Kylie Robison

#Meta #Facebook #Llama

**CEOTECH.IT** @ceotech@mastodon.social · Mar 25

**IT News** @itnewsbot@schleuss.online · Mar 22

Mar 22

IT News @itnewsbot@schleuss.online

The Fastest MS-DOS Gaming PC Ever - After [Andy]’s discovery of an old ISA soundcard at his parents’ place that once w... - https://hackaday.com/2025/03/22/the-fastest-ms-dos-gaming-pc-ever/ #retrocomputing #benchmark #isacards #ms-dos

Hackaday · Mar 22The Fastest MS-DOS Gaming PC EverAfter [Andy]’s discovery of an old ISA soundcard at his parents’ place that once was inside the family PC, the onset of a wave of nostalgia for those old-school sounds drove him off the…

**Nicolas MOUART** @silentexception@mastodon.social · Mar 20

Mar 20

Nicolas MOUART @silentexception@mastodon.social

This is an interesting benchmark.

o1, the superduper model only completes 84.2% of the tasks in the test, 99.2% in correct format.. Qwen2.5-Coder-32B, a relatively small model which can run locally obtains 72.9% and 100.0%.
Yet another proof that LLM/transformers are not great : if a LLM cannot format the code correctly, and/or correct it itself to mark 100%, what is the use in this? Obviously, well, a new way to make money (tokens)..
https://aider.chat/docs/leaderboards/edit.html

#genAI #benchmark #coding

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

#benchmark