this post was submitted on 07 Jan 2026
346 points (98.6% liked)

Technology

78543 readers
3128 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[โ€“] Earthman_Jim@lemmy.zip 1 points 3 days ago (1 children)

Is it? What's so fucking hard about it?

[โ€“] howrar@lemmy.ca 1 points 3 days ago

The main difficulty is in how many hyperparameters are involved in training an RL agent, high sensitivity of RL algorithms to those hyperparameters, and not having a good understanding of how to select them based on the properties of your task. This problem is exacerbated by the high sample complexity of RL. If something doesn't work out, you don't know if it's because you chose the wrong set of hyperparameters or if you just haven't trained for long enough.

I don't know much about game design, but I do know that it's a much more mature field than RL, so surely they have better tools than guessing and praying.