lemmy.net.au

43 readers
1 users here now

This instance is hosted in Sydney, Australia and Maintained by Australian administrators.

Feel free to create and/or Join communities for any topics that interest you!

Rules are very simple

Mobile apps

https://join-lemmy.org/apps

What is Lemmy?

Lemmy is a selfhosted social link aggregation and discussion platform. It is completely free and open, and not controlled by any company. This means that there is no advertising, tracking, or secret algorithms. Content is organized into communities, so it is easy to subscribe to topics that you are interested in, and ignore others. Voting is used to bring the most interesting items to the top.

Think of it as an opensource alternative to reddit!

founded 11 months ago
ADMINS
13301
 
 

A century ago, Estonia, Latvia, and Lithuania fought to free themselves from Russian rule—only to be conquered again by the Soviet Union during World War II. When the USSR collapsed in 1991, they emerged devastated but determined to rebuild on Western terms. Over the next three decades, they transformed from poor post-Soviet republics into some of Europe’s fastest-growing economies—embracing democracy, digitalization, and NATO protection. But after Russia’s invasion of Ukraine, the old fear returned. Now, the Baltic State s are pouring billions into defense, fortifying borders, and training citizens to fight. Having risen from oppression to prosperity once before, they know exactly what they stand to lose.

13302
13303
 
 

Just to keep my fellow lemmygradians updated on what AI tools are capable of, and also because I'm pretty stoked for this project.

I put 5$ in the deepseek api (sidenote: I like that you have to top up a credit balance and they don't auto bill) then downloaded crush. Crush is an agentic coding tool, meaning basically it instructs the LLM to do stuff automatically.

It made me a complete python script to first download all of the ProleWiki content pages into txts (which also means we can do backups, even if it's a little hacked together).

Then with a second script we are running these txts through a (local) LLM to translate them for our French instance. The problem is there are 5000 pages on the EN instance and a grand total of 3 in French, so nobody is interested in joining and writing pages from scratch when you could "just" find them on the EN instance.

For these two scripts (which are running right now) I've paid a whopping 67 cents on API. It amounts to a few hours of prompting and then of course waiting for the agent to work.

Cache hit on deepseek is a godsend for agentic work as it's basically free (less than 2 cents per 1M token), and with a codebase you constantly feed it the same code over and over. This is why my cache hit is so high.

Compare to GPT-5 which costs 12 cents per 1m cache hit.

What's pretty amazing (and scary, it's very scary using crush) is that you can just go do something else while it works and puts everything together. Go have dinner while the agent is on the task, or watch a youtube video.

The third and final script will be used to upload the translated files to the wiki. I still need to think about what exactly I want it to do (write API access is not a problem, the problem is just the logic of it all).

As for running the translation job if you're curious, it saves its progress so I can stop and resume any time I want and I estimate around 6-8 days of continuous running to go through everything (there's a lot of material). Yes we could use an API or even rent a GPU and multithread but eh, I figured I only have to do this once. And there's a LOT of tokens to translate, you won't escape that. Even using a cloud API it would probably take a few days of continuous querying.

But compare to doing it by hand which, well, we haven't even started despite the instance existing for 4 years. So it's basically 4+ years vs 8 days of work.

Later I can adapt this specific script to work on books to bring more exclusive theory to English like we did for the CIA's Shining Path which was done with what is now an almost obsolete model lol (and I definitely improved the prompting since that one). I might actually redo CIA's Shining Path with mistral just to see how it differs.

The problem if anything is this is making me learn stuff like git to make it FOSS and downloadable and make it more robust to handle more usecases lol skeptical

About crush:

Before I started using crush I didn't really get what an agent actually did or helped. So this isn't just putting a prompt into the web interface and asking it to generate python code. The agent makes sure to take care of everything, including writing functions tests and fixing bugs. That's right, this thing fixes its code automatically.

It calls tools and terminal commands by itself, and can edit files. When it does you get a git-like preview of the lines edited.

to use crush you just prompt the LLM. "Okay now I want to do this, now I want to do that, there's a bug here's the log" and it will work through the problem by itself. It's scary how fast it does it.

You can extend its capabilities with LSPs and MSPs but I haven't looked into that yet. Which it was more user-friendly to set up, but I got there in the end.

Caveats:

deepseek boasts a pretty comfy 128k tokens context window, but you run through it quickly because it has to read and understand the entire project. Crush handles this (it makes the LLM write to a crush.md file and then restarts the last command sent when context resets), but you're still limited. However with tools like deepseek-ocr, if they ever start integrating it, you have potentially infinite context. Clearly they're going to come up with something, they're already working on it. But you won't be recreating twitter with an LLM yet.

You don't want a specifically coding fine-tune for this as it needs to understand the file structure and the readmes. However I have run into situations where the LLM did stuff it shouldn't have done, for example deleting the database that keeps track of which files we've already worked through because it doesn't know this is the 'live' prod.

Mind you I'm pretty much cobbling this together so I don't git it or anything, it's just a one-time script for our specific needs and I shouldn't put the content files in the same folder as the script, it's just good practice. I def recommend keeping two copies of your project if you're not going to git. Crush works on one copy and then you can copy the files over to the other folder.

Oh also no chance of crush deleting system32 as it opens in a specific folder and can't leave it. Before running a script it also lets you review the code and asks for permission to run.

This is not replacing devs. It's a great addition to non-devs and devs alike. For non-devs it lets us write our scripts and solve our problems. For devs you spend more time thinking through and planning your app and then send the writing of it to the LLM. As a designer this speaks to me because we plan things a lot lol. And if you know your stuff, you can avoid some of the pitfalls the LLM might go into if you don't specifically prompt it for it.

If you also don't know some libraries or APIs very well it can handle them for you. You can totally give crush working code you wrote yourself, it's just that it might not be the most efficient way to use it since it could also write that code for you.

Your workflow is basically 3-10x more efficient with this and that's valuable - take a coffee break while it works, you deserve it. You become more of an engineer than a coder and imo this is where dev work is heading.

Translation work:

As for the translation, which is handled by mistral-3.2-instruct (a 24B model that fits on my 16GB and generates at 15 tokens per second, honestly good job France I gotta hand it to you), it's pretty good but you have to prompt it first. The prompt for this task is ~600 tokens, which is a lot but also not a lot considering I can easily have a 16k context window with this tool.

imo a lot of the "we spend more time fixing the translation than we would have spent doing it ourselves" comes from clients incorrectly prompting stuff (but what else is new lol), translators not necessarily using tools to automate bulk edits, and older models not doing as good a job - deepseek is actually pretty solid at translating because of the thinking, though we didn't use a thinking model for this task.

Translating the filenames is messier and more prone to hallucinating random characters. I think it's because it just doesn't have a lot to work on, you're asking it to translate 5 seemingly random words. Translating the page content is much better, some pages that I checked are pretty amazing.

Not all languages work equally. I used Mistral specifically bc it's french so we assume it understands french better. Some languages don't have 'enough' presence to be trained on effectively, and others are just not a priority for devs. Chinese LLMs are seemingly better at Persian for example but still not 'great'.

Another thing is it sometimes translates jargon two different ways. It would need a dictionary or something like that that says "this word is always translated as X". I'm sure this will come, and in fact a simple dictionary is probably an old-school method for an LLM already. But you would also need to build that dictionary and when you have 5000 pages of content I just don't know where you would even begin.

Even with those caveats it gets us 80-90% of the way there and the remaining work will be to fix stuff manually as we come across it. Or with mass regex edits. If we can get interest to the FR instance with this as one of our editors has alluded to, then we can also count on crowdsourcing the rest of it over time.

Conclusion:

We're doing pretty exciting things for 67 cents.

13304
13305
1
7th Win Babyyyyy! (external-content.duckduckgo.com)
submitted 3 months ago* (last edited 3 months ago) by pineapplelover@lemmy.dbzer0.com to c/hockey@lemmy.ca
 
 

We taking that frickin cup

13306
13307
 
 

Unbelievable

https://www.occrp.org/en/news/eu-continues-exporting-banned-pesticides-to-global-south-despite-pledge-to-end-trade

#EU #EC #Europe #EuropeanCommission #policy #politics #LMIC #LIC

13308
13309
 
 

Congressional Republicans are reportedly trying to insert anti-abortion language into government funding legislation as the shutdown continues, with the GOP and President Donald Trump digging in against a clean extension of Affordable Care Act tax credits as insurance premiums surge.

Sen. Ron Wyden (D-Ore.), the top Democrat on the Senate Finance Committee, sounded the alarm on Saturday about what he characterized as the latest Republican sneak attack on reproductive rights.

Wyden said Saturday—which marked day 39 of the shutdown—that “Republicans are spinning a tale that the government is funding abortion.”

“It’s not,” Wyden continued. “What Republicans are talking about putting on the table amounts to nothing short of a backdoor national abortion ban. Under this plan, Republicans could weaponize federal funding for any organization that does anything related to women’s reproductive healthcare. They could also weaponize the tax code by revoking non-profit status for these organizations.”

“The possibilities are endless, but the results are the same: a complete and total restriction on abortion, courtesy of Republicans,” the senator added. “Trump said he’d leave abortion care up to the states. Well, this latest scheme makes it crystal clear: A de facto nationwide abortion ban has been his plan all along.”

13310
13311
13312
 
 
13313
 
 

Last thread here

Welcome to this week’s casual kōrero thread!

This post will be pinned in this community so you can always find it, and will stay for about a week until replaced by the next one.

It’s for talking about anything that might not justify a full post. For example:

  • Something interesting that happened to you
  • Something humourous that happened to you
  • Something frustrating that happened to you
  • A quick question
  • A request for recommendations
  • Pictures of your pet
  • A picture of a cloud that kind of looks like an elephant
  • Anything else, there are no rules (except the rule)

So how’s it going?

13314
 
 

Direct link to the Nintendo compatibility site: https://switch-software-compatibility.nintendo.com/en-US

13315
 
 

cross-posted from: https://lemmy.ml/post/38768545

13316
13317
 
 

TIM KAINE IN THE BRAIN libbing-out

13318
 
 

Beijing officials call China’s current deflationary malaise “involution” — a destructive cycle of intense, self-destroying business competition sparked by excess capacity. Yang Zhifeng calls it something else: “twisted.”

Deflation signals a lopsided economy where supply dwarfs demand. That hurts companies, which in turn hurts workers. As consumption weakens, businesses spend less, economic activity slows, debt burdens rise, which then causes more deflation. The downward loop, known in economics as a deflationary spiral, feeds on itself once entrenched.

The trend also carries global implications: cheap Chinese exports can depress prices abroad, strain relations with trading partners, and create knock-on effects for multinational companies. Global institutions are sounding the alarm, with the International Monetary Fund projecting that consumer inflation in China will average zero this year — the second-lowest of nearly 200 economies it tracks. The Bank of Korea warned in July that China could export deflation to its trading partners.

And the problem could be even worse

And the problem could be even worse than they realize. China’s official CPI figure — which offers limited item-level detail and is shaped by a complex methodology that isn’t transparent — has hovered around zero since early 2023, occasionally posting modest gains. Bloomberg News analyzed prices for dozens of products in 36 major cities as well as both official and private data across China to get a sense of how much cheaper things have become on the ground. We looked at items in categories like food, groceries, consumer goods and services, as well as housing costs and price changes for specific car brands.

The analysis showed that prices are unmistakably dropping. Among 67 items tracked by Bloomberg News, prices on 51 dropped over the last two years. Economists say that official inflation measures may only partially capture the reality. Many key data series have quietly disappeared in recent years, and the National Bureau of Statistics has never offered the sort of granularity more common in the US, where inflation trackers go so far as to publish the cost of indoor plants and pet food. An outdated methodology for calculating rent changes in the CPI likely led to its overestimation in the past few years.

The NBS didn’t reply to a faxed request for comment.

13319
 
 

Former IDF military advocate general Maj. Gen. Yifat Tomer-Yerushalmi, who was released to house arrest last week after being detained over the Sde Teiman video leak, was hospitalized on Sunday morning after medics were called to her home over what was later confirmed to have been a suicide attempt.

On Sunday night, Police Commissioner Danny Levy confirmed that Tomer-Yerushalmi had been hospitalized following an attempt to end her own life. "Her life isn’t rosy,” he said of the disgraced prosecutor, after a protester was detained outside of her home.

The police chief also addressed the allegations against Tomer-Yerushalmi, saying that “if she committed the offense, this affects how the army looks, how soldiers behave. We send our kids to an organization where they should be sure that nobody is leaking things, and that’s why we’re probing it,” he added.

13320
13321
13322
 
 

The Senate is voting on the first steps to end the 40-day government shutdown Sunday after a group of moderate Democrats agreed to proceed without a guaranteed extension of health care subsidies, angering many in their caucus who wanted to continue the fight.

The group of three former governors — New Hampshire Sen. Jeanne Shaheen, New Hampshire Sen. Maggie Hassan and Independent Sen. Angus King of Maine — said they would vote to reopen if the Senate passed three annual spending bills and extend the rest of government funding until late January. Senate Majority Leader John Thune endorsed the deal Sunday night and called an immediate vote to begin the process of approving it.

The deal would also include a future vote on the health care subsidies, which would not have a guaranteed outcome, and a reversal of the mass firings of federal workers that have happened since the shutdown began on Oct. 1. The full text of the deal has not yet been released.

13323
13324
 
 

Mashhad, home to around 4 million people and Iran’s holiest city, relies on four dams for its water supply. Esmaeilian said consumption in the city had reached about “8,000 litres per second, of which about 1,000 to 1,500 litres per second is supplied from the dams”.

Authorities in Tehran warned over the weekend of possible rolling cuts to water supplies in the capital amid what officials call the worst drought in decades. The Iranian president, Masoud Pezeshkian, has cautioned that without rainfall before winter, even Tehran could face evacuation.

In the capital, five major dams supplying drinking water are at “critical” levels, with one empty and another at less than 8% of capacity, officials say.

13325
view more: ‹ prev next ›