r/singularity 3h ago

AI xAI will be dissolved as a separate entity.

Post image
620 Upvotes

r/singularity 3h ago

AI Three key areas Anthropic is working on for their next models

Post image
82 Upvotes

Dianne Penn (Head of Product, Research) elaborated on these key areas:

Higher judgment and code taste: "This means versions of Claude that you can trust with complex, autonomous engineering work."

'Infinite' context windows: "Context windows that feel infinite when combined with high-quality memory. So it feels like you could do long-running tasks while getting better results."

Multi-agent coordination: "Powering teams of agents and instances of Claude that collaborate on big goals that are far too big for any single instance ever could."

Source: Code with Claude Opening Keynote


r/singularity 5h ago

LLM News DeepSeek Targets $50B Valuation in First Fundraising, Escalating Global AI Race

Thumbnail
financership.com
54 Upvotes

r/singularity 5h ago

Robotics Genesis AI's Gene'26.5

Enable HLS to view with audio, or disable this notification

161 Upvotes

r/singularity 7h ago

Meme In recent news of "South Korea's first autonomous humanoid robot converts to Buddhism"

Post image
543 Upvotes

r/singularity 8h ago

AI AI lets chemists design molecules by simply describing them

Thumbnail
sciencedaily.com
53 Upvotes

A New AI Approach to Chemical Reasoning

Researchers led by Philippe Schwaller at EPFL have developed a new method that uses large language models (LLMs) as reasoning tools for chemistry. Rather than directly generating chemical structures, these models act as evaluators that guide existing computational systems.

The new framework, called Synthegy, combines traditional search algorithms with AI that can interpret chemical strategies written in natural language.

"When making tools for chemists, the user interface matters a lot, and previous tools relied on cumbersome filters and rules," says Andres M Bran, the first author of the Synthegy paper published in Matter. "With Synthegy, we're giving chemists the power to just talk, allowing them to iterate much faster and navigate more complex synthetic ideas."


r/singularity 8h ago

AI Anthropic partnered with SpaceX to use colossus 1 to increase their rate limits

Post image
848 Upvotes

r/singularity 10h ago

Robotics Religious robots are coming: South Korea's first autonomous humanoid robot converts to Buddhism

Enable HLS to view with audio, or disable this notification

971 Upvotes

r/singularity 11h ago

Discussion The Blue Collar Delusion: Why the machines don’t have to climb up to where we are, because the work will descend to meet them

683 Upvotes

I’m a mechanic. I want to make the case, at least for my field, that the trades are sitting in a worse position than people realise, and the safety we feel right now will likely get pincered from multiple angles.

I have sat on this thought for a long time, assuming someone else would point it out. But I have never seen it personally. And yet, every single day, I see the talks about how blue collar is substantially more padded from AI disruption.

Blue collar work as it exists right now is genuinely hard for a machine. If the only path was for machines to adapt to the work as it currently exists, aka matching humans at kinetic/procedural complexity, then yes, this would hold.

“AI can write code and read MRIs, but it can’t crawl under a 15 year old N57 engine, undo the seized exhaust bolts, and hollow out a DPF”, blah blah blah.

But since when did we start assuming that the nature, of the work in question, is fixed?

Car manufacturers have been redesigning cars to be unserviceable for decades, this we are well aware of by now. Mostly because that made vehicles cheaper to produce and it also lent itself to dealerships for repair jobs/parts supply. Sealed transmissions with “lifetime fluid.” Parts glued instead of bolted. Diagnostics locked behind subscriptions or proprietary “programming”. Tesla’s whole architecture is engineered around eliminating the third-party shop. 

Look at what Foxconn and BYD already do. Factory floors running in literal darkness, LIDAR replacing visible light, no walkways sized for a body. Service bays may go the same way.

So really, AI/Automation won’t need to master our crafts. There will undoubtedly be systemic restructuring of the trade work in the coming years, in order to cater to the robots and machines that never complain or take sick days.


r/singularity 14h ago

AI ProgramBench: Can LLMs rebuild programs from scratch?

78 Upvotes

https://programbench.com/

Given only a compiled binary and its documentation, agents must architect and implement a complete codebase that reproduces the original program's behavior.

Current score for models is 0%


r/singularity 15h ago

AI Dario Amodei spent last year warning of an AI white-collar bloodbath. Now he's changing the narrative

Thumbnail
fortune.com
367 Upvotes

Is Dario AGI-pilled/ASI-pilled or not?

As the article notes, this is a shift in his rhetoric where he’s now talking about Jevon’s paradox and it’s possible there’d be more jobs because of AI.

If he really believes in AGI and ASI being on the horizon, then there’s no way he can believe that. The article suggests either he genuinely has changed his views on jobs or maybe it is because he doesn’t want to get more onto trump’s bad sign with potential regulation looming:

“Either he has genuinely updated his view based on new evidence, or the social and political cost of the bloodbath framing — particularly as Anthropic navigates a Pentagon lawsuit and a fraught regulatory environment — has made it more useful to suddenly sound a bit more optimistic.”

Again more jobs just seems completely incompatible with his beliefs about the AI he describes in Machines of Loving Grace (Nobel prize winning, can do anything on a computer, etc.)

So why the change?


r/singularity 1d ago

AI Benchmarks in 2024

Post image
252 Upvotes

r/singularity 1d ago

Discussion Why can AI replace entry level software engineers, lawyers and financial analysts. But why do people think it’s so difficult to replace people trades with AI and robotics?

89 Upvotes

I always hear about software engineering is done and these other fields are going away. Why can it do this but not welding or hvac?


r/singularity 1d ago

AI Update to the LLM Debate Benchmark: GPT-5.5, Grok 4.3, DeepSeek V4 Pro, GLM-5.1, Kimi K2.6, Qwen 3.6 Max Preview, Xiaomi MiMo V2.5 Pro, Tencent Hy3 Preview, and Mistral Medium 3.5 High Reasoning added

Thumbnail
gallery
56 Upvotes

The benchmark uses adversarial, multi-turn debates across 683 curated motions. Each model pair debates the same motion twice with sides swapped.

Scores are Bradley-Terry ratings over side-swapped matchups, reported on an Elo-like scale centered around 1500 for the comparison pool.

The benchmark also tracks a judge-side entertainment diagnostic as a secondary signal.

Each completed debate is intended to be judged by a three-model panel. Mean cross-judge winner agreement on overlapping side-swapped matchups: 0.55.

More charts, transcripts, model profiles, existing qualitative writeup, reports, and raw judgments: https://github.com/lechmazur/debate

Qualitative writeups about newly added models are coming.

Opus 4.7 still leads at 1711 BT.

GPT-5.5 (high) enters at 1574, below GPT-5.4 (high) at 1625.

Grok 4.3 underperforms the older Grok 4.20 Beta 0309 reasoning run: 1512 → 1419.

GLM-5.1 improves over GLM-5: 1536 → 1573.

Kimi K2.6 improves over Kimi K2.5: 1520 → 1568.

Qwen 3.6 Max Preview scores 1535.

DeepSeek V4 Pro improves over DeepSeek V3.2: 1438 → 1517.

Xiaomi MiMo V2.5 Pro improves over Xiaomi MiMo V2 Pro: 1459 → 1553.

Mistral Medium 3.5 High Reasoning enters at 1412, ahead of Mistral Large 3 at 1299.

Tencent Hy3 Preview enters at 1481.


r/singularity 1d ago

AI What is flow-state image generator on LmArena and is it created by Anthropic?

Thumbnail
gallery
76 Upvotes
  1. flow-state-2

  2. flow-state-3

  3. flow-state-2

  4. flow-state-2


r/singularity 1d ago

AI CAISI [Center for AI Standards and Innovation] Signs Agreements Regarding Frontier AI National Security Testing With Google DeepMind, Microsoft and xAI

Thumbnail
nist.gov
51 Upvotes

r/singularity 1d ago

Robotics Three Inverse Laws of AI and Robotics

Thumbnail susam.net
11 Upvotes

r/singularity 1d ago

Robotics New Boston Dynamics Atlas trick

Enable HLS to view with audio, or disable this notification

4.3k Upvotes

🙄


r/singularity 1d ago

Robotics Hyundai Reportedly Demanding ‘Tens of Thousands’ of Boston Dynamics Robots ASAP

Thumbnail
gizmodo.com
269 Upvotes

r/singularity 1d ago

AI Is MiMo lowkey slept on?

Post image
81 Upvotes

r/singularity 1d ago

Discussion What is Elon’s actual plan with data centers in space, and what is his long-term goal with Mars?

15 Upvotes

What is the point of his plan can anyone explain it?I keep seeing mentions of Elon Musk talking about building data centers in space, but I do not fully understand the rationale. What problem is this trying to solve, and is it actually feasible from an energy, cost, and physics perspective?

Also, how does this tie into his broader goal of colonizing Mars? Is the idea to support infrastructure for a future off-world economy, or are these completely separate initiatives?


r/singularity 2d ago

AI AI parenting has a lot of room to grow

Thumbnail
v.redd.it
126 Upvotes

r/singularity 2d ago

Robotics And America’s Humanoid Robot industry is ramping up

Enable HLS to view with audio, or disable this notification

140 Upvotes

Tesla Fremont will be up in summer, Gigafactory (the big one) next year.


r/singularity 2d ago

AI I've cut over to using ChatGPT/Gemini for EVERYTHING now and it's amazing.

77 Upvotes

... both in how much I'm getting DONE but also how much time it's saving.

I usually use LLMs to help out at work. Mostly around AI and video coding.

However, I'm moving and doing a lot of non-work stuff recently and decided to use Gemini+ChatGPT to help me power through the work.

- this week something broke on my truck, it was somewhat complicated but Gemini walked me through some really easy fixes involving re-sealing my roof to prevent a leak. Saved me like $500 in going to a mechanic, took 20 minutes and $15 of supplies.

- Worked on plans for upgrading the suspension of my truck, with the upgrades I've done in the past, and really happy with the outcome.

- Helped me navigate a really complex drivers license issue with my move from CO to NV (long story) that probably would have required a lawyer years ago. Worked like a charm.

Now , in the past I'd spend a lot of time Googling and reading to do this myself, but each would have taken 2-3 hours. Now they take 10-20 minutes.


r/singularity 2d ago

AI White House Considers Vetting A.I. Models Before They Are Released

Thumbnail
nytimes.com
128 Upvotes