More

nabakin · 2026-04-24T16:55:26 1777049726

> Also, note that there's zero CUDA dependency. It runs entirely on Huawei chips.

That is a huge claim to make with no evidence.

I researched what you said, and I have found no statement to that effect in their paper[0], on huggingface[1], twitter[2], WeChat[3], or in their news release[4].

They only mention as a footnote in only the Chinese version of their news release that they plan to reduce inference costs with the Ascend 950 supernode when it releases[5]. The only mention of Huawei in their paper is that they validated a technique to lower interconnect bandwidth on Ascend NPUs and Nvidia GPUs[6].

[0] https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro/blob/main...

[1] https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro

[2] https://xcancel.com/deepseek_ai/status/2047516922263285776

[3] https://mp.weixin.qq.com/s/8bxXqS2R8Fx5-1TLDBiEDg

[4] https://api-docs.deepseek.com/news/news260424

[5] https://api-docs.deepseek.com/zh-cn/img/v4-price.png

[6] Page 16

glenstein · 2026-04-24T17:22:00 1777051320

Comments like this are why I go to the comments! I never would have thought to check.

And while I'm here I want to note that I feel there's a big misunderstanding of what is and isn't demonstrated by DeepSeek. So far as I can tell the major (and important!) innovation is reproducing near-frontier level capabilities at a fraction of the cost, but it may be the case that iterating forward at the frontier is the costly thing and is a cost borne by Western companies and that nuance seems to get lost with DeepSeek. Which is not to say that as a matter of principle that non Western companies aren't sometimes capable of jumping into the lead (Kimi has been super impressive) but if GPT/Claude/etc "only" lead at the frontier with more expensive models, that's still a moat.

kybernetikos · 2026-04-24T22:12:26 1777068746

If you can get something almost as capable for a fiftieth of the price, in most cases you'll do that. You might still send a few tokens to the more expensive option for the exceptional, difficult cases, but that's maybe 10% of the tokens at most. I don't see how it'll be possible to keep spending what anthropic, openai, google etc are spending if they're only going to see the trickiest 10% of tokens.

MiiMe19 · 2026-04-25T02:30:44 1777084244

Missed the point award

kybernetikos · 2026-04-25T10:32:29 1777113149

Maybe I need to spell out the step that connects them - how will those companies afford to keep "iterating forward at the frontier" when they probably have a huge crash in their income coming from competition with good enough, but 1/50th the price cheaper and open models.

Iterating forward at the frontier doesn't seem like a sustainable approach if everyone else can catch up with you in 6 months.

Scipio_Afri · 2026-04-24T20:08:16 1777061296

Thank you for this due diligence, I was just reading through the technical report and couldn’t find any references to the software stack or hardware mentioning Huawei either and came back here wondering about this comment that I had read earlier.

chvid · 2026-04-24T20:11:34 1777061494

Not long ago the story was this:

DeepSeek’s next AI model delayed by attempt to use Chinese chips

https://www.ft.com/content/eb984646-6320-4bfe-a78d-a1da2274b...

jari_mustonen · 2026-04-24T18:22:56 1777054976

Here's a note about running entirely on Huawei chips:

https://finance.yahoo.com/sectors/technology/articles/deepse...

tadfisher · 2026-04-24T19:04:45 1777057485

> DeepSeek indicated that current service capacity for the V4 Pro series is constrained by a computing crunch, though pricing could fall after new clusters powered by Huawei's Ascend 950 chips come online in the second half of the year.

Only mention of Huawei in that article (as of now).

selectodude · 2026-04-24T19:05:14 1777057514

Did you read any part of the link you posted? Huawei is mentioned once and not in the context of the model being trained or currently running on Huawei chips.

vedaba · 2026-04-24T19:27:10 1777058830

Dammit, you found my technique of “citing” sources for papers in high school...

selectodude · 2026-04-24T20:01:00 1777060860

At least when I pulled random citations off Wikipedia I could reasonably trust whoever put it there figured it was tangentially related to what was being cited. I’m not sure I could get away with putting a literal press release that I didn’t read anywhere.

Big L for media literacy there.

czk · 2026-04-24T19:52:48 1777060368

They mention it uses MXFP4 quant which is a blackwell capability but it looks like this is also supported by ascend 950 series according to marketing material

kappi · 2026-04-24T17:40:35 1777052435

DeepSeek is planning to use Huawei extensively for inference

“Due to constraints in high-end compute capacity, the current service capacity for Pro is very limited. After the 950 supernodes are launched at scale in the second half of this year, the price of Pro is expected to be reduced significantly.”

https://x.com/jukan05/status/2047516566149816627

nabakin · 2026-04-24T18:01:29 1777053689

Yes, that's the footnote from citation [5].

nsoonhui · 2026-04-24T23:43:52 1777074232

I said the same thing as you and I got summarily downvoted (https://news.ycombinator.com/item?id=47888227).

That HN is quick to upvote an unsubstantiated comment ( the grandparent one, because it aligns with the anti US bias? ) and downvote fact finding one doesn't bode too well for the community as a whole. I have seen enough how polticial ideology colors everything in my home country( Malaysia), and the decline of the country is palpable, and I don't expect to find such a thing here. We are supposed to be impassioned and rational, right ?

Render to Jesus what's due to him, ditto for Caeser.

nabakin · 2026-04-25T00:38:58 1777077538

Probably because you said you used DeepSeek. People don't want to see AI in the comments and don't trust AI responses.

nabakin · 2026-04-19T00:14:09 1776557649

> When it comes to information transfer and processing, light can do things that electricity can’t. Photons — particles of light — are far zippier than electrons at working their way through circuits.

Electrons themselves don't move at the speed of light, but information transfer (i.e. communication) via electrons does happen close to the speed of light.

A subtle, but important, distinction that's often misunderstood and means computational performance gains would probably come from bandwidth, not latency.

Hikikomori · 2026-04-19T07:56:17 1776585377

About 0.6c for cat6 cables, different types of cables can be slightly faster. Speed of light in fiber is also 0.6c due to the refractive index of the core.

derefr · 2026-04-19T20:01:13 1776628873

Is that through solid-core fiber? Because hollow-core fiber also exists.

Hikikomori · 2026-04-19T20:09:08 1776629348

Yes. It does, but it's not widely used, it barely just got out of the lab.

thayne · 2026-04-19T18:30:04 1776623404

> but information transfer (i.e. communication) via electrons does happen close to the speed of light

Speed of light in the medium, not speed of light in vacuum.

NL807 · 2026-04-19T05:56:02 1776578162

In electric circuits, information is transmitted through the electric field, which itself is close to the speed of light.

tempaccount5050 · 2026-04-19T06:57:41 1776581861

Nope, it's 1/2 - 2/3 the speed of light depending on the metals used.

KK7NIL · 2026-04-19T11:16:42 1776597402

The velocity factor is usually 0.6-0.7, never seen it as low as 0.5.

And it's set by the dielectric, not the conducting material.

spwa4 · 2026-04-19T08:02:16 1776585736

You're both wrong. It's true that the first whisper of movement travels at the speed of light, but the time until the flow stabilizes (which you WILL need to wait for in electrical chips) is actually slower than the "speed of electricity".

Oh and also: currently the idea behind on-chip lasers is interconnects that don't have this limitation. For example, PCIE is looking to build optical interconnects, which will do the equivalent of bringing every GPU 10x closer to the memory.

Optical computation would require that light switches light transistors on and off, which doesn't seem to be possible with this technology. This is optical computation in the sense of allowing light beams to be produced according to formulas.

BenjiWiebe · 2026-04-19T14:41:58 1776609718

Why do you need to wait for it to stabilize? You can keep changing the voltage at one end of the connection even if you have megabits of data currently in transit, without waiting for it to stabilize. Yes, you'll need to do impedance matching. Yes, that's a solved problem. Transmission lines.

sbuttgereit · 2026-04-19T20:41:36 1776631296

Looking at the discussion below this comment, I'd just add this video by AlphaPheonix:

https://www.youtube.com/watch?v=2Vrhk5OjBP8

Good discussion in the comments there as well.

nabakin · 2026-04-02T17:43:46 1775151826

Faster than TensorRT-LLM on Blackwell? Or do you not consider TensorRT-LLM open source because some dependencies are closed source?

melodyogonna · 2026-04-02T19:10:59 1775157059

I reviewed the TensorRT-LLM commit history from the past few days and couldn't find any updates regarding Gemma 4 support. By contrast, here is the reference for MAX:https://github.com/modular/modular/commit/57728b23befed8f3b4...

nabakin · 2026-04-02T19:34:09 1775158449

If OP meant they have the fastest implementation of Gemma 4 on Blackwell at the moment, I guess that is technically true. I doubt that will hold up when TensorRT-LLM finishes their implementation though.

pama · 2026-04-02T19:57:58 1775159878

How is the sglang performance on Blackwell for this model?

nabakin · 2026-04-02T20:55:05 1775163305

Dunno but there's a PR for it. Probably also more performant than Modular.

nabakin · 2026-04-02T17:17:26 1775150246

It's referring to the Lmsys Leaderboard/Lmarena/Arena.ai[0]. It's very well-known in the LLM community for being one of the few sources of human evaluation data.

[0] https://arena.ai/leaderboard/chat

nabakin · 2026-04-02T17:00:46 1775149246

Public benchmarks can be trivially faked. Lmarena is a bit harder to fake and is human-evaluated.

I agree it's misleading for them to hyper-focus on one metric, but public benchmarks are far from the only thing that matters. I place more weight on Lmarena scores and private benchmarks.

nl · 2026-04-03T00:26:05 1775175965

Concentrating on LMAreana cost Meta many hundreds of billions of dollar and lots of people their jobs with the Lllama4 disaster.

moffkalast · 2026-04-02T17:41:05 1775151665

Lm arena is so easy to game that it's ceased to be a relevant metric over a year ago. People are not usable validators beyond "yeah that looks good to me", nobody checks if the facts are correct or not.

culi · 2026-04-02T19:02:09 1775156529

Alibaba maintains its own separate version of lm-arena where the prompts are fixed and you simply judge the outputs

https://aiarena.alibaba-inc.com/corpora/arena/leaderboard

jug · 2026-04-02T18:03:14 1775152994

I agree; LMArena died for me with the Llama 4 debacle. And not only the gamed scores, but seeing with shock and horror the answers people found good. It does test something though: the general "vibe" and how human/friendly and knowledgeable it _seems_ to be.

nabakin · 2026-04-02T18:00:00 1775152800

It's easy to game and human evaluation data has its trade-offs, but it's way easier to fake public benchmark results. I wish we had a source of high quality private benchmark results across a vast number of models like Lmarena. Having high quality human evaluation data would be a plus too.

moffkalast · 2026-04-02T18:12:38 1775153558

Well there was this one [0] which is a black box but hasn't really been kept up to date with newer releases. Arguably we'd need lots of these since each one could be biased towards some use case or sell its test set to someone with more VC money than sense.

[0] https://oobabooga.github.io/benchmark.html

nabakin · 2026-04-02T19:30:14 1775158214

I know Arc AGI 2 has a private test set and they have a good amount of results[0] but it's not a conventional benchmark.

Looking around, SWE Rebench seems to have decent protection against training data leaks[1]. Kagi has one that is fully private[2]. One on HuggingFace that claims to be fully private[3]. SimpleBench[4]. HLE has a private test set apparently[5]. LiveBench[6]. Scale has some private benchmarks but not a lot of models tested[7]. vals.ai[8]. FrontierMath[9]. Terminal Bench Pro[10]. AA-Omniscience[11].

So I guess we do have some decent private benchmarks out there.

[0] https://arcprize.org/leaderboard

[1] https://swe-rebench.com/about

[2] https://help.kagi.com/kagi/ai/llm-benchmark.html

[3] https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard

[4] https://simple-bench.com/

[5] https://agi.safe.ai/

[6] https://livebench.ai/

[7] https://labs.scale.com/leaderboard

[8] https://www.vals.ai/about

[9] https://epoch.ai/frontiermath/

[10] https://github.com/alibaba/terminal-bench-pro

[11] https://artificialanalysis.ai/articles/aa-omniscience-knowle...

nabakin · 2026-03-12T16:16:10 1773332170

And nowadays we have Debian running in a VM on Android [1]

[1] https://www.zdnet.com/article/how-to-use-the-new-linux-termi...

nabakin · 2026-03-03T16:49:08 1772556548

I would consider it reasonable if this was 4x TTFT and Throughput, but it seems like it's only for TTFT.

nabakin · 2026-01-23T20:11:20 1769199080

And right to repair

nabakin · 2025-12-16T02:39:36 1765852776

TIL Europe still has some presence in the Americas. Thought all of that was gone with the Monroe Doctrine

dentemple · 2025-12-16T05:47:25 1765864045

The Monroe Doctrine was about preventing colonial powers from enacting NEW efforts to reach into the Americas, not about getting rid of previous control.

"The occasion has been judged proper for asserting, as a principle in which the rights and interests of the United States are involved, that the American continents, by the free and independent condition which they have assumed and maintain, are henceforth not to be considered as subjects FOR FUTURE COLONIZATION by any European powers." (emphasis mine)

https://usinfo.org/PUBS/LivingDoc_e/monroe.htm

Scarblac · 2025-12-16T03:48:24 1765856904

France's longest land border is the one it shares with Brazil.

brnt · 2025-12-16T03:15:21 1765854921

You may find this interesting: https://en.wikipedia.org/wiki/Special_territories_of_members...

phantasmish · 2025-12-16T04:48:51 1765860531

Yeah, you can visit the EU by… sailing a ways Northeast(ish) from Maine, until you’re just south of (a part of) Canada. And by going to the Caribbean. And South America.

Mostly France and the Netherlands.

nabakin · 2025-12-16T18:02:02 1765908122

Ty this is great

nabakin · 2025-11-24T17:34:58 1764005698

I understand they are similar, but I think this post adds new information to the situation. Regardless, appreciate your help moderating the site.