this post was submitted on 02 Oct 2025
105 points (91.3% liked)

Technology

77096 readers
2838 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] InEnduringGrowStrong@sh.itjust.works 101 points 2 months ago (17 children)

Microsoft says its Agent Mode in Excel has an accuracy rate of 57.2 percent in SpreadsheetBench, a benchmark for evaluating an AI model’s ability to edit real world spreadsheets.

It generates 42.8% bullshit.

[–] jubilationtcornpone@sh.itjust.works 43 points 2 months ago (9 children)

They probably view that as a statistic worth bragging about. It's not. If Excel got calculations right 57.2% of the time it would be completely worthless.

[–] PerogiBoi@lemmy.ca 3 points 2 months ago (1 children)

I asked copilot to look through my every spreadsheet and find how many instances of a category occurred. I was curious to see if it was any good. Gave me 2 different numbers. Neither were correct.

[–] jubilationtcornpone@sh.itjust.works 4 points 2 months ago (2 children)

Copilot: Putting the "Artificial" in Artificial Intelligence.

[–] sirboozebum@lemmy.world 2 points 2 months ago

Fartificial Intelligence

[–] PerogiBoi@lemmy.ca 2 points 2 months ago

The tech behind LLMs could have just been Clippy and everyone would be happy.

load more comments (7 replies)
load more comments (14 replies)