overview for morto

BBC gains rare access to the Congolese mine powering mobile phones in c/technology@lemmy.world

[–] morto@piefed.social 6 points 1 day ago

An archived version, for anyone having access difficulties:
https://web.archive.org/web/20250713020638/https://www.bbc.com/news/articles/cyvj986l615o

AI agents wrong ~70% of time: Carnegie Mellon study in c/technology@lemmy.world

[–] morto@piefed.social 5 points 1 week ago (4 children)

and doesn't need to be exactly right

What kind of tasks do you consider that don't need to be exactly right?

Are there any tools I can use for translating a ~400 pages scanned book? in c/asklemmy@lemmy.world

[–] morto@piefed.social 3 points 1 week ago

I'm not sure if it would be viable for a long book, and I'm also avoiding google, but thanks for helping. I got some nice suggestions in this thread.

Are there any tools I can use for translating a ~400 pages scanned book? in c/asklemmy@lemmy.world

[–] morto@piefed.social 3 points 1 week ago

Well, I'm avoiding google, but I will keep it in mind as a last last resort, thanks

Are there any tools I can use for translating a ~400 pages scanned book? in c/asklemmy@lemmy.world

[–] morto@piefed.social 3 points 1 week ago

I'm giving preference to open source tools, but that's a good thing to know, thanks

Are there any tools I can use for translating a ~400 pages scanned book? in c/asklemmy@lemmy.world

[–] morto@piefed.social 3 points 1 week ago

Thanks for the suggestions. That OCR_translate looks interesting. I will prioritize other recommended tools that seem to be more focused on books, but I bookmarked it for future needs.

Are there any tools I can use for translating a ~400 pages scanned book? in c/asklemmy@lemmy.world

[–] morto@piefed.social 3 points 1 week ago (1 children)

I used tesseract, but the output pdf didn't have visible text, and I found no way to change it. Maybe I don't know how to properly use it., or it's not intended to keep formatting.

Are there any tools I can use for translating a ~400 pages scanned book? in c/asklemmy@lemmy.world

[–] morto@piefed.social 4 points 1 week ago (1 children)

That PaddleOCR looks very interesting. It will even extract images and formulas and somewhat preserve formatting in the output! I will try this one, even if takes more than a day to process is with my low end cpu. Thank you for the suggestion!

40

Are there any tools I can use for translating a ~400 pages scanned book? (piefed.social)

submitted 1 week ago* (last edited 1 week ago) by morto@piefed.social to c/asklemmy@lemmy.world

19 comments fedilink

Situation: I got a scanned book that I'd like to read that is in chinese and has no available translation. I really want to read it, because it would probably help a lot with my university project.

What I tried: tried creating a version with ocr to get a text layer and use some translation tool on it, but found no way to make the ocr text visible. I also tried this tool, but the ocr didn't work for me, and I found no way to use it with some local model

Have any of you ever done a similar task? I'd appreciate any kind of suggestions and tips.