TSMC and Graphcore Prepare for AI Acceleration on 3nm

jonplackett · on Aug 27, 2020

This must be scary reading for Intel, still battling to make anything decent at 10nm (which I know is more like 7nm TSMC, but still, this is another 2 steps on from their 7).

It's a bit worrying how all this innovation, with the possibility (perhaps even likelihood) of completely cornering the market, is all in one country, that is very close to another country, that doesn't want it to be its own country.

calcifer · on Aug 28, 2020

> all in one country, that is very close to another country, that doesn't want it to be its own country.

That's actually good news for Taiwan. TSMC is already very important for everyone. If it becomes the only player in high-end chip making, there is a good chance western countries won't let China invade Taiwan.

pk_kinetic · on Aug 28, 2020

Why would China invade Taiwan when it can just buy all of its engineers?

calcifer · on Aug 28, 2020

Because China thinks that territory belongs to them, their motivation has nothing to do with TSMC.

davedx · on Aug 27, 2020

TSM will build a factory in the USA: https://time.com/5837274/tsm-chip-plant-arizona/

ETHisso2017 · on Aug 27, 2020

which is only 5nm, only entering production in 2023, and only 20,000 wafers / month vs their gigafabs which are orders of magnitude larger

m00x · on Aug 27, 2020

It's a good start. They need to move carefully while balancing their assets.

They also need to train the new employees in the US, it'll take time to get great chip engineers there, and get Taiwanese volunteers to move to the US.

amelius · on Aug 28, 2020

Isn't IC fabrication mostly automated? So wouldn't most jobs be in research?

jiggawatts · on Aug 28, 2020

It is automated in the sense that silicon goes in one end, and ideally no human being touches it until it comes out of the other end in a finished package. It is not at all automated in the sense that there is a ton of maintenance work on the production line.

I love reading through random tech journals, the type with highly industry-specific ads in them, as an insight into the worlds other people live in. Jobs and every day problems I will never experience first-hand.

One of the single most eye-popping experiences like that was an ad in a chip industry quarterly I found somewhere that was breathlessly advertising that their latest tool (a machine the size of a garage) now had an MTBF of "just" 27 hours instead of the previous 2-3 hours, a massive tenfold improvement! Cutting edge stuff, apparently, compared to the competition. The really impressive part apparently was that by cutting down on maintenance time, it could be "online" for up to 90% of the time instead of just 30-50% of the time or somesuch. Apparently that's impressively good in this space.

The rest of the ad (and much of the journal) kept going on and on (and on!) about how their tool has easy-swap parts, can keep the vacuum during repair, has tool-less panels, quick access, no need for heavy lifting, etc...

I don't think I'm exaggerating too much by saying that nearly 50% of all technical development in the fab industry is about reducing the maintenance effort, and hence staffing costs.

amelius · on Aug 28, 2020

I can only wonder why one would use ads in an industry which is so specialized and has so few players. Wouldn't it be more efficient to just talk to these people?

skissane · on Aug 28, 2020

I think they want to do both.

Put ads and positive press in trade journals you know your customers read. It increases the percentage of their minds you occupy.

In-person contact does that too. They aren't mutually exclusive strategies.

A lot of this equipment is so expensive, that the cost of a few trade journal ads is negligible compared to the amount of revenue involved.

throwaway4good · on Aug 28, 2020

Why would they do that?

oseityphelysiol · on Aug 28, 2020

The moving part? Probably because of better salaries. I don't know much about Taiwan and how nationalistic they are, but for most of the world working in the US is seen as the career endgame (or means to get there).

If the question was why would Intel invest into making the chips in US - from recent news it seems like US understands how dependent it is on other nations for silicon manufacturing, which poses many questions. The biggest one seems to be national security, which looking from the side is the only factor that gets thing done in US.

vaxman · on Aug 28, 2020

> I don't know much about Taiwan and how nationalistic they are

You will want to read up on The Republic of Formosa, WWII and the history of Taiwan at some point. There are enough Chinese nationals on this website to downvote us all but suffice to say, it is widely regarded at the federal level that Taiwan could come under siege by the Chinese Communist Party at anytime and be used as a bargaining chip with the West along with Hong Kong, the DPRK armistice and Western financial interests in Macau. We need to begin evacuating Hong Kong and Taiwan immediately back to the USA --in fact, China just seized a boat of people fleeing Hong Kong last night, the first time they have done that (and on what charges, nobody knows, not even the Chief of Police of Hong Kong). As the President said today, we are going to break it off cold with China and as you expand your knowledge of the situation, you will realize that China claims Taiwan as part of its territory and the people of Taiwan do not.

m00x · on Aug 29, 2020

Hong Kong was legally handed over to China, but they had a 50(?) year handover period which they're breaking. Taiwan was never China's. A bunch of Chinese nationals went to China during the cultural revolution, and China would love to have Taiwan, but Taiwanese people are basically Chinese refugees and do not want that. China has been posturing to conquer them for years. My wife's parents (Taiwanese) immigrated to Canada during one of the periods of high tension between the countries. China was flying fighter jets over Taipei as an intimidation tactic.

baybal2 · on Aug 28, 2020

> which is only 5nm, only entering production in 2023, and only 20,000 wafers / month vs their gigafabs which are orders of magnitude larger

And there is a good chance that it may never be, just like Foxconns Wisconsin facility.

CarbyAu · on Aug 27, 2020

The US invaded for oil.

What will it do for chip production?

breatheoften · on Aug 28, 2020

Lose its competitive advantage, slowly consolidate remaining assets into private hands with borderless migration capacity, elect a government composed entirely of conman, and five to ten years from now, begin losing every violent international confrontation that it tries to start?

CarbyAu · on Aug 28, 2020

I agree with everything but...the losing of every violent international confrontation.

Maybe you'd lose against other modern, well integrated forces. But there is a lot of the world whereby forts, trenches, gun emplacements are still their best. And they are simply butter to the modern military knife.

Barring major catastrophe,(ww3, another american civil war, a pandemic that kills large % of younger population etc) it will take far more than 10 years to decline past that point.

The Iraq Military Exercise (I can't call that a war.) is a clear example of that.

raducu · on Aug 28, 2020

What war could the US start without the usual and immediate involvement of Russia, China or Iran -- I mean, except for a war with its own citizens that is.

The US lost the Afghanistan war, is slowly losing Irak to Iran, lost all face and is getting humiliated in Syria, failed at preventing Russia impose its presidential candidate, is alienating its european allies (see the Iran sanctions debacle), is supporting the savage Saudi Arabia -- who is losing the war in Yemen.

bioipbiop · on Aug 28, 2020

The US won Afghanistan handily, they just didn’t know what to do with it when they had it. I think America would have been better off placing Afghanistan under a military governor for a few years while they built up the country’s institutions.

raducu · on Aug 28, 2020

Most powers make the same false assumptions that they "easily won" an insurgency/guerrilla war.

No you didn't win anything, you temporarily held territory, you wasted enormous resources, and were in fact beaten by goat herders in the end.

Without putting the whole country in a concentration camp and re-educating for 20 years and without providing an alternative economic model, a central administration in Afghanistan is irrelevant wether you do it for "a few year" or decades.

bioipbiop · on Aug 29, 2020

I think you’re conflating Afghanistan with Vietnam. America never fully committed to Afghanistan as they did with Vietnam. From the very outset they tried to do it on the cheap by relying heavily on the northern alliance. Subsequent to the invasion, America was far more interested in pursuing the Iraq war, so Afghanistan quickly became a backwater.

This meant that America became heavily reliant on local war lords (some of whom had dealings with the Taliban) to ensure security and maintain order. This undermined the government they were trying to build in Kabul and contributed to a culture of corruption. None of which endeared the common afghan to their newly formed institutions. The Taliban exploited these weaknesses with classic insurgency tactics and gradually took territory from the weak central authorities.

This was all entirely avoidable, America just didn’t stay focused on the mission.

shpeedy · on Aug 28, 2020

IMHO, you should replace RF with Turkey. RF was unable to grab Ukraine or Georgia, which were completely unprepared to war.

raducu · on Aug 28, 2020

RF completely achieved its objectives in Ukraine, Georgia and Syria, without putting a "we have won" show.

Sure, there are sanctions, because Russia actually annexed Crimea.

Russia knows very well that teritory grabbing in itself is pointless in the 21th century, Russia grabbed enough territory to nullify any chance Ukraine or Georgia will join NATO.

Don't get me wrong, as an eastern european, I hate what Russia stands for, but militarily and geostrategiclly, they know very well what they are doing, because they cannot aford not to, unlike the USA.

craftinator · on Aug 28, 2020

When's the last time we didn't lose every violent international confrontation we were involved in? Also, yeah, the rest of that is both incredibly depressing, and of such a high plausibility that I can't see another way of it turning out. Pretty much like the US in Neal Stephenson's Snow Crash.

sgillen · on Aug 28, 2020

Idk the US “won” in Iraq very quickly (toppled the head of state, dissolved the official military). It was the occupation and rebuilding efforts afterwards that it failed at IMO.

raducu · on Aug 28, 2020

The US victory was a textbook 19th century victory, but a 21th century catastrophic failure/pyrrhic victory -- in Irak and Afghanistan.

You cannot win a war if you fail to reach your strategic objectives -- then it's not a war, but a bar brawl you "won".

sgillen · on Aug 28, 2020

The Iraq/Afghan wars were a mess, but the strategic objectives aren't even entirely known and some of them were achieved. The Iraq war at least:

1. Secured strategic access to Iraq's oil resources. 2. Establish a permanent military presence in the region. 3. Removed the the de facto government.

raducu · on Aug 31, 2020

I was talking about the rational objectives of the US as a state, if we agree that the US is a captured state by criminals with unknown strategic objectives or contrary to the US as a country, then, all bets are off.

1. The US was in no danger of remaining without oil, the price of oil skyrocketed after the US invasion, and the "access" was certainly not worth trillions of dollars that the war cost, not to US as a whole.

2. The US already had a military presence in Saudi Arabia, and without the vicious circle of violence it itself caused, the US had no reason to maintain a permanent military presence in Irak.

Just see the recent Soleimani debacle -- the US is one such debacle away from being kicked out of Irak after blowing those trillions, killing hundreds of thousands of civilians, helping spawn Daesh, causing economic devastation in Irak and the region.

rbanffy · on Aug 29, 2020

True. I don't think establishing a functional government that would enable Iraq to become a regional power was in the cards. The US wanted oil.

raducu · on Aug 31, 2020

But the US could have just bought the oil on the open markets.

In 2008, the US imported 600.000 barrels of oil per day from Irak -- that's when oil peaked at ~150 -- but if the whole year the oil stayed at 150$/barrel, and the US just stole it without paying Irak anything, that's just 32 billion USD.

It would take the US 60 years to recuperate all the money it would have spent on Irak war if it was to steal all the oil from Irak at the levels of 2008, or 380 years at todays import levels and valuations (assuming the oil was war booty -- it is not).

There is no way the war was started over oil, that would be beyond dumb.

My personal theory is it was just the industrial-military oligarchs who started the war as to forever saddle the US taxpayer.

rbanffy · on Sept 1, 2020

> My personal theory is it was just the industrial-military oligarchs who started the war as to forever saddle the US taxpayer.

That, and US construction and oil companies.

skissane · on Aug 28, 2020

The US used to have tens of thousands of troops in Taiwan. The US military formally withdrew in 1979, although it has quietly retained small numbers of US military personnel in Taiwan ever since, responsible for training and liaison with Taiwanese military forces, but not enough to be militarily significant.

A quick, unexpected, massive deployment of significant US military assets to Taiwan would put Beijing in a very difficult position. Either Beijing attacks, and starts a shooting war with the US – which would do massive damage to the Chinese economy; or else, Beijing doesn't and loses a lot of face in the process.

It would be a rather risky, high stakes gamble, but one in which the US might come out in front.

jonplackett · on Aug 28, 2020

And what happens to the US (and world) economy if there's a war? And how would that war end without taking the world with it?

I think COVID has shown Chinese society/government/people are much better equipped for big shocks than the US is so if it's a protracted thing they would have a huge advantage.

ckocagil · on Aug 28, 2020

You're saying unexpected but China has likely simulated this situation a million times and prepared a plan. I would bet China is constantly monitoring for such deployment and will react long before the troops set foot on Taiwan. AFAIK nearby island are already heavily militarized.

Also I'm not sure how people of Taiwan would react to such deployment. Neither Taiwan nor China recognize each other as a separate country. They both claim to be the righteous government to one unified China.

gerash · on Aug 27, 2020

IIUC graph core chips are very different from Intel chips so idk if their manufacturing can be compared. Or is it that chip manufacturing does not depend on what chip is being manufactured?

fluffy87 · on Aug 27, 2020

When will GraphCore enter MLPerf and let the industry verify their claims?

Until then, it’s vapor ware. I’d refer to this comment https://news.ycombinator.com/item?id=21530810

fluffy87 · on Aug 27, 2020

GraphCore has been claiming to be a generation further than the comptetition for over 3 years already.

This is easy to do when you don’t have to use any standard benchmarks (MLPerf) and do not allow anybody to verify any claims.

neatze · on Aug 27, 2020

I read everything I could find, even contact MS to get demo for Graphcore IPU, no success, so in my perspective something smells fishy, because, you can rent for one minute quantum computer, but you cannot get access to IPU's, only availble access is 10K a month.

fluffy87 · on Aug 27, 2020

I don’t even need access. Just need verified MLPerf results.

agumonkey · on Aug 27, 2020

I'm honestly asking the question, where does the compute power go these days ?

is it big data analytics from the largest socnets ?

redisman · on Aug 27, 2020

ML is enabling tons of new sinks for compute in addition to all the old cloud and supercomputer workloads

agumonkey · on Aug 27, 2020

What are the usual use of ML right now ?

I mostly think about ML for self driving vehicles but I'd like to know where else it's applied.

david-gpu · on Aug 27, 2020

Recommenders, speech to text, text to speech, page ranking in search engines, fraud detection, video analytics, content filtering (from dick pics to censoring news), etc.

IanCutress · on Aug 28, 2020

Training vs inference. Training data sets is expanding 3-10x every 6 months.

not2b · on Aug 27, 2020

Getting very close to the end now. The nearest-neighbor distance between Si atoms in a crystal is 0.235 nm, so a feature with 3nm width is 12.7 Si atoms across. It is amazing that this can be done, and the physics are increasingly weird at this level of scaling.

zionic · on Aug 27, 2020

>so a feature with 3nm width

Sigh, another post... another top comment not understanding that marketing name =/= any actual feature size anywhere on the chip.

I've been reading threads like this for 10+ years now, and somehow this knowledge has still not permeated the tech culture here and on reddit.

variaga · on Aug 27, 2020

Once upon a time (the "Dennard Scaling Era") VLSI circuit design used the same relative geometry at different "feature sizes" - all dimensions of the design(wire width, wire spacing, gate length, gate pitch, ..., etc.) scaled by the same amount from generation to generation, so it was possible to completely specify the transistor layout with a single dimension (generally called 'L'), and derive all other measurements from that as a multiple of 'L'. Transistor density was proportional to 1/L^2.

The measurement that was used originally for 'L' was the gate length.

As designs shrunk below 40nm, it became impossible to shrink _every_ dimension proportionally. In particular, for planar silicon, gate length stops shrinking around ~30nm, but other things could still shrink. This meant that transistor density could still increase, but the relative geometry of wires/gates/spacing/etc. had to change, so it was no longer possible to specify the full geometry with a single number.

But people liked the single number as a handy way of comparing processes, so marketing kept using it as a way to compare processes. The way they decided to do that was mostly to try and keep the proportionality between the transistor density of a process and 1/L^2.

To the extent a "feature size" number of a process means anything, it means "the relative transistor density of this process is equivalent to what you would get if you had used the old (>40nm) geometry, and shrunk 'L' to the specified feature size". Even that relationship has degraded in recent years - now it's more like "we calculate the new feature size as the size of the previous process divided by sqrt(2)".

Regardless, as stated in the parent, there is no single dimension of any recent process that corresponds to the '3nm' number.

There's lots of resources online that describe this, but for an overview, you could start here (describes pre- and post-Dennard scaling): http://www.eng.biu.ac.il/temanad/files/2017/02/Lecture-4-Sca...

egsmi · on Aug 27, 2020

Nice explanation. One can see an example of the scalable CMOS design rules on slides 10 and 11 of this deck.

https://inst.eecs.berkeley.edu/~cs250/fa09/lectures/lec01.pd...

(To the casual reader: Note how the dimensions all have no unit. There are measured in L as indicated by the parent.)

ur-whale · on Aug 27, 2020

Thanks for that explanation.

What would be even more useful is an actual answer to the underlying question the OP seems to be making: how much further until one of the many dimensions you are talking about simply runs out of Si atoms?

In other words, however "made-up" the 3nm marketing number may be, physics limits should still dictate a lower bound for it, and the OP seems to be wondering what that is.

garmaine · on Aug 27, 2020

There isn't an answer to that. We've already hit physical scaling limits for the old style (>40nm) of transistor manufacturing. Each successive generation since then has employed new tricks to improve performance, and each jump in performance is labeled with a smaller process node size number. This can keep going so long as there are more optimizations to be found. And since not all process optimizations involve shrinking dimensions, it's not necessarily the case that we can predict from physical principles when this will end.

For example, stacked chips are increasingly being used but are fundamentally limited by heat transport. Maybe when we get into sub-1nm "sizes" the process nodes will be defined by how well they transport heat out of volumetric chip designs? Or we'll switch to twisted graphene superconductors for certain components which increases efficiency without necessarily shrinking feature sizes. Etc.

I'm just throwing those possibilities out. The point is we can't predict when scaling will ultimately end.

wtallis · on Aug 27, 2020

That's still not a straightforward question to answer, because transistor shrinks aren't just about shrinking some dimensions while others stay at their limits. Transistor geometry has changed in more fundamental ways. Beyond ~28nm, the industry switched from planar transistors to FinFETs, so now instead of gate width we have an extra dimension and have to consider stuff like fin height and fin thickness and pitch. Starting around 3nm, we'll be seeing "gate all around" transistors—GAA FETs, in the form of nanowire, nanoribbon or nanosheets.

egsmi · on Aug 27, 2020

It depends on how one defines transistor, but if the definition of voltage controlled switch is sufficient then one atom will do.

https://en.wikipedia.org/wiki/Single-atom_transistor

This has nothing to do with the mass produced transistors (yet!) those are 10s of nm across even in 3 or 5nm.

mrfusion · on Aug 27, 2020

So how big is a modern transistor? How close to together do the transistors get?

Does this mean there’s actually a lot more room to shrink things?

variaga · on Aug 27, 2020

TSMC's 5nm 'N5' process -the highest density process currently shipping- has a raw transistor density of 173 million per millimeter^2. (https://en.wikipedia.org/wiki/5_nm_process)

If the transistors were laid out on a square grid (they aren't - it's rectangular), each square would be 76nm on a side. This area includes the transistor itself, the contact area (to connect the transistor to wires) and the required spacing to prevent the transistors from interfering with each other.

gonehome · on Aug 27, 2020

> "Regardless, as stated in the parent, there is no single dimension of any recent process that corresponds to the '3nm' number."

That's interesting - are they just entirely making that up then? What's the 3nm supposed to represent?

It seems like it's one thing to pick a specific dimension length to measure even if it's not proportional to all of the others, and another to just pick one that isn't represented at all.

egsmi · on Aug 27, 2020

3nm and 5nm are just marketing names it does not represent any geometry of the transistor. Probably the best analogy is 3nm would be the average length of the side of a pixel. One draws a transistor, or anything else, from many pixels.

The exact details are under NDA but to get a 'very' approximate idea of the scale of things one can look at the 5nm Wikipedia page.

They list the metal pitch as 30nm in TSMC's N5 node so in general two pieces of metal cannot be within 6 'pixels' of one another. One gets a rough guess on the distance between transistors by looking at the gate pitch (roughly 10 pixels in this case) but that measurement comes with a lot of caveats too.

Keep in mind this is when you're going out of your way to make something tiny but there are many good electrical engineering reasons to make the transistors larger still, and quite a lot of them are.

https://en.wikipedia.org/wiki/5_nm_process

fomine3 · on Aug 28, 2020

Wikichip is also great. https://en.wikichip.org/wiki/5_nm_lithography_process

garmaine · on Aug 27, 2020

> That's interesting - are they just entirely making that up then? What's the 3nm supposed to represent?

A progression over 5nm. That is all.

lisardo · on Aug 27, 2020

That was really instructive. Thank you for sharing.

solidasparagus · on Aug 27, 2020

I hate when people complain about how uninformed everyone is, but don't put in even the tiniest amount of effort to help people become more informed. At the very least, include a link.

ajnin · on Aug 28, 2020

It's not fair to blame people for not knowing that "3nm" does not mean "3nm". Knowledge about measurement units is more prevalent than knowledge about marketing practices of semiconductor companies. Blame those misleading practices instead.

Zenst · on Aug 27, 2020

Exactly as one vendors 7nm may well contain more atoms than another vendors 10nm node. Node numbers mean nothing beyond comparing against comparable node's.

Yes it's sad that node sizes are marketed in the same way the GHz race was at one stage and equally the disparity is greater when comparing nodes like for like.

pottertheotter · on Aug 27, 2020

Is there a dimension that can be used to compare across process tech since process node is now marketing and not gate length?

not2b · on Aug 28, 2020

The narrowest dimension of a FinFET is roughly the process node size, within a small factor, so my comment applies. In fact, the first FinFETs were actually narrower than the node size when they were introduced.

googlryas · on Aug 27, 2020

It would be nice if we could standardize on some physical feature size that we would then call everything equally. It's silly to have a measurement system which is calibrated differently for what should be apples-to-apples comparisons. Imagine if some car manufacturers listed the length of their car from the rear bumper to the steering wheel and other measured their lengths from the rear bumper to the front bumper.

XnoiVeX · on Aug 27, 2020

It may not be generally true but _it is_ true for the smallest possible feature so it's not entirely incorrect either.

i-am-curious · on Aug 27, 2020

This is unfortunately prevelant on most discussion forums. In many cases this correction is downvoted and buried deep.

jonplackett · on Aug 27, 2020

and you're even getting downvoted for pointing it out...

nitrogen · on Aug 27, 2020

The comment is not in the gray as I write this, but one tends to get better results generally by providing more information in a positive light, rather than simply saying that something is wrong without providing anything better.

For example, if the parent comment had said, "Yes, the control over small numbers of atoms is interesting. Although transistors are going to be much larger than that, it is cool that the shrinking feature size allows [making up a "fact" here for demonstration purposes] edges of transistors to be sharper and a little closer together, so yields are higher for a given transistor density," then it would be more useful and better received.

Also, remember "today's 10000": https://xkcd.com/1053/

craftinator · on Aug 28, 2020

But keep in mind, the parent poster likely didn't mean his words in a positive light. To portray them as such would be dishonest, which is unethical, confusing to all involved, and would lead to further confusion and incoherence as the conversation progressed.

jonplackett · on Aug 28, 2020

and now I'm downvoted for pointing out that it's being pointed out... will it ever end?

(it was in the grey when I commented, but i upvoted it so I think that brought it back into the black)

robin_reala · on Aug 27, 2020

It’s worth pointing out that these node names are only mildly related to feature size at this point.

api · on Aug 27, 2020

They are more like minimum possible feature size, not average feature size. The real metrics you want to look at are things like transistor density, power density, power use, etc.

microcolonel · on Aug 27, 2020

And even those things need to be assessed in the context of a specific design.

dekhn · on Aug 27, 2020

The real limits to performance aren't Moore's law: it's heat dissipation and the speed of light. We could do tons of high quality ML with 10nm+.

dekhn · on Aug 27, 2020

I'm not sure why I was downvoted, but my comment is technically correct (I work in ML and supercomputing; nearly all modern large ML and supercomputing devices are effectively a tradeoff between how much heat-producing CPU you can fit in a space, and how much latency-killing long cables you have.

See also: https://en.wikipedia.org/wiki/Limits_of_computation

ben_w · on Aug 27, 2020

For what an anecdote is worth, even back in the ‘90s my dad was telling me how the speed of light dictated the shape of supercomputers.

dekhn · on Aug 27, 2020

It was far worse then, as copper places serious upper limits on long-distance high-speed transport. Data transmission latency of copper tends to be 2X that of light (physical bounds, not empirically observed), and making very long cables with copper is impractical due to signal losses. Once long fiber was cheaply available people started to use that for the long links (imagine a toroidal mesh with wrap-around links- the wrap-around links are very long (100+m) and need to be fiber).

However, long fiber causes latency problems- those wrap around links will slow do any global reductions you need to do.

elcritch · on Aug 28, 2020

I can't help but imagine wormholes or FTL tech being first invented and used for longer distance interconnects in supercomputers. :-)

Animats · on Aug 27, 2020

It's getting worse for AI chips and better for memory. GPU-type devices doing machine learning have all that compute silicon running flat out all the time, all emitting heat. Flash memory, at least so far, doesn't seem have the data rate to have a heat problem. At any one time, most of the device is inactive.

It's amazing that there are 2 terabyte USB sticks for US$40.

literallycancer · on Aug 27, 2020

>It's amazing that there are 2 terabyte USB sticks for US$40

There aren't. There are scam products that claim to have 2 terabytes capacity though.

bioipbiop · on Aug 28, 2020

Not quite $40, but 2TB USB drives do exist. Who knows what you’d store on it though!

https://www.walmart.com/ip/Kingston-DataTraveler-Ultimate-GT...

why_only_15 · on Aug 28, 2020

We're not that far away from that point -- here's a 1 terabyte USB for $200: https://www.amazon.com/-/es/PNY-Elite-Flash-Speeds-P-FD1TBPR...

Animats · on Aug 28, 2020

Bad link.

Here's the 2TB USB drive I was talking about.[1] It's on Amazon, so it's probably fake.

[1] https://www.amazon.com/jing-Compatible-Computer-High-Speed-D...

robotnikman · on Aug 28, 2020

When it comes to flash memory, its best to avoid no-name brands and prices that seem too good to be true.

But considering we now have 1TB microSD cards, its definitely feasible to make a flash drive with such capacity

deafcalculus · on Aug 27, 2020

It is getting worse for Flash. There are consumer M.2 SSDs that have huge heatsinks and some even with fans!

Mark Cerny says this is a concern for PS5 which will have user expandable SSD storage. Unlike SATA drives, the M.2 standard doesn't define z-height, and the drives that meet the PS5 min spec are too thick right now.

wtallis · on Aug 27, 2020

The main problem there is the SSD controller interfacing between the flash memory and the CPU. Right now, the only SSD controller that supports PCIe gen4 speeds and is small enough to fit on a M.2 card is a controller made on TSMC's 28nm process. Everybody else in the industry decided to move to TSMC's 16/12nm processes before trying to ship a high-performance PCIe gen4 SSD controller. Doing really advanced error correction at 5+ GB/s takes some juice, so SSD controllers have to follow in the wake of CPUs and smartphone SoCs by moving to smaller but more expensive (at least up front) process nodes.

shantara · on Aug 27, 2020

It's not going to happen any time soon, but I'm looking forward to "computronium" as described in sci-fi - a computing substrate optimized at the molecular level to the point that the performance becomes the function of its volume.

ivalm · on Aug 27, 2020

I wonder if we can go to even higher density. From entropy constraints, computing performance asymptotically scales with surface area [0,1]. The general bound is Bekenstein bound[2]

[0] https://en.wikipedia.org/wiki/Black_hole_thermodynamics

[1] https://en.wikipedia.org/wiki/Holographic_principle

[2] https://en.wikipedia.org/wiki/Bekenstein_bound

Filligree · on Aug 27, 2020

The bound exists, but it's so many orders of magnitude out that for all practical purposes we don't need to worry about it.

Of course, however, heat dissipation presents similar challenges.

elcritch · on Aug 28, 2020

It'll be interesting to see how/if quantum effects will be used in future computing tech (well aside from the whole quantum bangaps in semi-conductors). Effects like thermal super-conductors could enable much higher heat in a given area, perhaps built with quantum dots on semiconductors (IIRC, I thought some researchers tried that).

lotyrin · on Aug 27, 2020

I guess if we hit the edge of feature density, we might start building things out of 3d FPGA type designs of homogeneous modules but because of heat, the capacity of such a design is ultimately going to be limited by its surface area, not its volume, no?

craftinator · on Aug 28, 2020

My CPU is a sphere!

gwern · on Aug 27, 2020

Here's a helpful recent read on what the metrics mean and what semis are trying to replace them with: https://www.gwern.net/docs/cs/2020-moore.pdf

mdorazio · on Aug 27, 2020

Indeed. I know there was a working 1nm node transistor out of Berkeley a couple years ago, but as far as I know that's pretty close to the limit. I'm really curious what will come after we hit that in production in ~10? years.

ben_w · on Aug 27, 2020

Single-atom transistors are a thing. I can’t even begin to guess how difficult anything like that will be to mass-produce.

https://en.wikipedia.org/wiki/Single-atom_transistor

deepnotderp · on Aug 27, 2020

Can you link to that?

mdorazio · on Aug 27, 2020

Sure thing: https://newscenter.lbl.gov/2016/10/06/smallest-transistor-1-...

deepnotderp · on Aug 27, 2020

Thanks!

IanCutress · on Aug 28, 2020

Intel has a roadmap to 1.4nm

IanCutress · on Aug 28, 2020

https://www.anandtech.com/show/15217/intels-manufacturing-ro...

amq · on Aug 27, 2020

There are plenty of areas for improvement beyond pure transistor size:

- density

- heat

- resistance

aoeusnth1 · on Aug 27, 2020

Is no one interested in the stats listed on the card? 200TFlops for 300W? That is incredible power efficiency! What kind of connectivity solution is needed to feed it?

ml_hardware · on Aug 27, 2020

It's not that impressive.. The NVIDIA A100 has better efficiency: 312 TFlops @ 400W for the same precision (FP16.32).

fluffy87 · on Aug 27, 2020

Also the A100 is cheaper, and it’s perf is verified by MLPerf.

GraphCore’s claims are just that, claims. Some of these have been debunked already by people with access to the hardware (eg promised 10x speed ups were more like 1x).

ml_hardware · on Aug 28, 2020

Do you have a link to those results by independent researchers? I have also been very skeptical of GraphCore’s claims (and have talked about it on reddit) but haven’t had access or heard of anyone’s experience.

jklein11 · on Aug 27, 2020

Does the manufacturing of these processors require EUV?

wmf · on Aug 27, 2020

Yes, 5 nm and below uses EUV.

KorfmannArno · on Aug 27, 2020

Is there a good and short (5 mins max) video explaining how EUV works and how the produced chip works?

tedsanders · on Aug 27, 2020

It's 21 minutes, not 5 minutes, but here's a lecture by Chris Mack on EUV from 2013. He's fantastic. (Skip around or increase playspeed if you don't have more than 5 minutes to spare.)

https://www.youtube.com/watch?v=LHyV_-9JXu4

Chips produced by EUV work the same as any other chip. And even chips that are made with EUV are only using EUV for the very finest features - they're still using plenty of 'regular' lithography for the dozens of other lithography steps.

KorfmannArno · on Aug 28, 2020

I have way more than 5 minutes, just my attention-span is crippled.

mastax · on Aug 27, 2020

I don't know of a video but I'll try to summarize.

How to make a chip: 1. Slice pure silicon into a wafer. 2. Apply a thin coating of a light-activated chemical which does something to the wafer (etch, build up metal, dope[1] silicon, etc.). 3. Shine light through a mask which exposes a pattern into the silicon wafer. 4. Rinse off the un-activated chemical. 5. Repeat steps 2-4 many times for each layer. 6. Slice the wafer up into rectangular chips 7. Package the chips into a plastic, metal, ceramic, package with exposed metal contacts which can be soldered to a circuit board.

Why EUV? Extreme Ultraviolet - really small wavelength. When you're trying to expose really small structures you need a really small wavelength.

Why is EUV so hard? EUV is extremely high energy so it destroys everything, and it takes a huge amount of energy to create the light source. Also: quantum things.

[1]: https://en.wikipedia.org/wiki/Doping_(semiconductor)

tedsanders · on Aug 27, 2020

Require is a very tricky word. But yes, they will be using EUV.

Animats · on Aug 27, 2020

3nm by 2022. Is anyone in the US even considering a 3nm fab?

ahmedalsudani · on Aug 27, 2020

It’s well known that TSMC is the only company at that stage. Nobody else in the world, let alone America, is close to 3nm.

xadhominemx · on Aug 27, 2020

TSMC isn’t close to 3nm either... TSMC “3nm” is Intel 5nm. TSMC doesn’t get to Intel 3nm for at least 4 years.

ahmedalsudani · on Aug 27, 2020

Sure. But Intel 7nm isn’t TSMC 5nm ... and Intel isn’t at 7nm yet.

xadhominemx · on Aug 27, 2020

Intel 7nm is better than TSMC 5nm

ahmedalsudani · on Aug 28, 2020

Oh interesting—I wasn’t aware of that.

Apparently that’s according to Intel folks discussing it before they can produce it though. We’ll only really know when it’s ready. A lot of times, you make compromises to get something to market... so maybe even Intel 7nm won’t end up being “Intel 7nm” ;)

steve_musk · on Aug 27, 2020

Intel 7nm doesn’t exist, and TSMC 5nm chips have been in production for months now.

xadhominemx · on Aug 27, 2020

Yes I know...

ur-whale · on Aug 27, 2020

nm unit is not particularly telling us anything.

Watt/TeraFlop is a much more tangible way to understand why this is cool.

wetpaws · on Aug 27, 2020

Gentle reminder that 3nm is a meaningless marketing number that does not represent neither the transistor size no element density, and it does not correlate in any way with all other Xnm production of other companies.

phkahler · on Aug 27, 2020

>> ... it does not correlate in any way with all other Xnm production of other companies.

True. But it should be significantly better/smaller than TSMC 5nm which is claimed to be 1.85x density over TSMC 7nm, which they are using now to make a chip with 59 Billion transistors.

The only other players to compare to are Intel and Samsung.

KaoruAoiShiho · on Aug 27, 2020

It does correlate... generally the numbers respect actual end performance in respect to one another, nobody's having a smaller name that's actually worse than the bigger name of a different company.

doctoboggan · on Aug 27, 2020

So what does it mean then?

Koshkin · on Aug 27, 2020

According to Wikipedia [1], it refers to "the process' minimum feature size."

[1] https://en.wikipedia.org/wiki/Semiconductor_device_fabricati...

wetpaws · on Aug 27, 2020

In many cases, like in Samsung case, it does not even refer to the feature size, and just represents a generation of the tech process or some arbitrary incremental improvements.

zionic · on Aug 27, 2020

Literally nothing more than a marketing "group name" for a family of processes, none of which have any dimension approaching 3nm.

pkaye · on Aug 27, 2020

This talks a little about how the meaning has changed over time:

https://en.wikichip.org/wiki/technology_node

agumonkey · on Aug 27, 2020

Probably the size the lightwave smallest etch on a die. Now an etch doesn't make a transistor which means the smallest useful etch is larger.