Most of the bullshitters will tip their hand pretty early that they're just hype men for AI. Right off the bat, the fact that AI is disruptive and transforming society is apparently self-evident because they never cite a single premise or event to back this up. In the quote above, the phrase "if you've played" stuck out to me. Yes if you play around with them a little it's easy to believe they're really good at so many things. When you stringently evaluate them, you begin to see they make a lot of mistakes and perform inconsistently on even trivial tasks.
Yes, they’re very good coders; no, they aren’t perfect. Sometimes they make trivial mistakes or hallucinate, and other times they have unique insights about difficult problems or one-shot a lot of complex tasks.
If you don’t see the value of AI even at this point, you’re either lacking imagination and/or just too stuck in your own ways.
For me, AI tools shine when I know enough about a topic to quickly error check, but not enough where I can code fluently without documentation. I'm sure it will get more useful over time, but that's where it's been for me for the last year or so.
It seems that way if you judge them the same way you would human coders, but they’re different. They might be able to do things that veteran coders can’t without spending days on it, and fail at things that beginners can do in half an hour.
A car might not be able to traverse difficult terrain as well as a horse, but it doesn’t mean the car is not a good mode of transportation.
The sibling comment says it better - it's about predicability of mistakes, and the effort and knowledge necessary to correct them.
Maybe the next generation of models will solve these problems, who knows.
So if you drive well and can control this "car", you are able to fix that and benefit from the fast "car" a lot.
> or a car that randomly refuses to go on certain roads
This doesn't seem like a good comparison. AI will do anything, it may just be wrong.
I'm fine with ditching the car analogy though.
Anecdotally, I think this behavior is undesirable for most commercial LLM use cases. I have several friends that have complained about Gemini’s “back talking” and prefer ChatGPT’s relative sycophancy.
That doesn’t mean they are unpredictably bad, it means assuming that they behave like humans is a bad way of predicting their behaviour. “Unlike humans” and “unpredictable” are not the same thing. If you spend time working with them, you get a better sense of what they are good and bad at and get better at predicting their behaviour.
Also, their eagerness to produce. A good employee (senior or not!), knows when they're at the end of their wits and say so. In areas where LLMs are knowledgeable, but not knowledgeable enough, they'll still produce. They're stuck at unconscious incompetence, kind of like a systematic Dunning-Kruger effect. To solve that, we can either increase competence (moving out the area where this effect is noticeable), or maybe introduce some kind of feedback, self reflection (one could say, some kind of conciousness [0]).
[0] inspired by "Goedel, Escher, Bach"
If you tried to replace a programmer with whatever LLM it would completely flop. Just write a script to fetch Jira tickets and let it rip - it won't work well. Because it doesn't know that maybe the Jira ticket is stupid, or that if they implement this change it requires insight from Team X, or that this bug is really intended behavior, etc.
Being a programmer is much more than, like, making a neat regular expression or generating slightly customized boilerplate. As it stands, these are assistants in which programmers guide and handhold them the whole way. They are not coders.
Making those decisions and communicating is a big chunk of being a programmer.
> bu-but you can waste your day writing prompts to get around the fact that LLMs are brainless!
Yes, and?
But this line of "its only going to get even better" is a mantra that's endlessly expounded on, a brain worm even. It's never backed up with any observable evidence. It's marketing that they're tricking people into repeating.
This one in particular is a hearty kek.
Technology that any of us 5 years ago would've thought was a hundred years away and all you read is moving goalposts from ornery developers in denial.
Silicon Valley is cooked.
And therein lies the risk: research labs may become wholly dependent on companies whose agendas are fundamentally commercial. In exchange for access to compute and frontier models, labs may cede control over data, methods, and IP—letting private firms quietly extract value from publicly funded research. What begins as partnership can end in capture.
They already are. Who provides their computers and operating systems? Who provides their HR software? Who provides their expensive lab equipment?
Companies are not in some separate realm. They are how our society produces goods and services, including the most essential ones.
[0] https://iea.org.uk/wp-content/uploads/2021/12/mister-gotcha-...
This is the intention of tech transfer. To have private-sector entities commercialize the R&D.
What is the alternative? National labs and universities can't commercialize in the same way, including due to legal restrictions at the state and sometimes federal level.
As long as the process and tech transfer agreements are fair and transparent -- and not concentrated in say OpenAI or with underhanded kickbacks to government -- commercialization will benefit productive applications of AI. All the software we're using right now to communicate sits on top of previous, successful, federally-funded tech transfer efforts which were then commercialized. This is how the system works, how we got to this level.
I think that's the crux of the guy you're responding to's point. He does not believe it will be done fairly and transparently, because these AI corporations will have broad control over the technology.
Having been in this world though, I didn't see a reluctance in federal labs to work with capable entrepreneurs with companies at any level of scale. From startup to OpenAI to defense primes, they're open to all. So part of the challenge here is simply engaging capable entrepreneurs to go license tech from federal labs, and go create competitors for the greedy VC-funded or defense prime incumbents.
My reluctance is when we talk about fraud, waste, and corruptions in government, this is where it happens.
The DoD's budget isn't $1T because they are spending $900B on the troops. It's $1T because $900B of that ends up in the hands of the likes of Lockhead martin and Raytheon to build equipment we don't need.
I frankly do not trust "entrepreneurs" to not be greedy pigs willing to 100x the cost of anything and everything. There are nearly no checks in place to stop that from happening.
1.https://dsm.forecastinternational.com/2025/04/18/tariffs-tru... 2. https://www.xatakaon.com/materials/u-s-f-35-fighter-jets-and... 3. https://www.forbes.com/sites/davidjeans/2025/04/16/silicon-v... 4. https://www.19fortyfive.com/2025/03/the-navys-ford-class-120...
You can hope that a defense company is doing the right things in terms of supply chain attacks, but that's a pretty lucrative corner to cut. They'd not even need to cut it all the time to reap benefits.
The only other alternative is frequent audits of the defense company which is expensive and wouldn't necessarily solve the problem.
Reasonably there should be a two way exchange? It might be okay for companies to piggyback on research funds if that also means that more research insight enters public knowledge.
There’s zero acknowledgment or appreciation of public infra and research.
> “Why don’t we just let private industry build these giant engines for progress and science, and we’ll all reap the benefits?” The problem is that if we’re not careful, it could lead us to a very different country than the one we’ve been in.
This isn't about commercialization, it's about control. When access to frontier models and SOTA compute is gated by private interests, academics (and the public) risk getting locked out. Not because of merit, but because their work doesn't align with corporate priorities.
Sell assets like government real estate to themselves at super cheap rates and then set up as many dependencies as they can where the government has to buy services from them because they have nowhere else to turn.
To give an example this missile dome bullshit they are talking about building which is a terrible idea for a bunch of reasons.. but there is talks at the moment of having this run by a private company who will sell it as a subscription service. So in this scenario the US military can’t actually fire the missiles without the explicit permission of a private company.
This AI thing is the same scam.
If LLM/AI is critical to national security, then it should be funded solely via the Dep of Defense budget, with no IP or copy right derivatives allowed.
That said, I'm not very confident such a situation would happen in reality. I'm not confident current industry leaders can see past a quarter and nearly certain they can't see past 4. Current behavior already indicates that they are unwilling to maximize their own profits. A rising tide lifts all ships, but many will forgo the benefit this gives them to set out to explore for new and greater riches and instead are only able to envy the other ships rising with them. It can be easy to lose sight of what you have if you are too busy looking at others.
[0] Simplified example illustrated by Iterative Prisoner's Dilemma: https://www.youtube.com/watch?v=Ur3Vf_ibHD0
[0.1] Can explain more if needed but I don't think this is hard to understand.
I want to interrogate AI optimist type people because even if AI is completely safe and harmless I literally see only downsides.
Is your perception that living in theorized extreme comfort correlates to "reward"?
It's mostly because the actual stated and actualized goal of real AI is clearly bad.
It's like if you approached me and said "I'm going to erase meaning from your life. You will want for nothing. The entire paradigm of reality will be changed: your entire evolutionary struggle, made meaningless. You will want for nothing: "be" nothing. Also this might potentially kill you or some rich person or more likely nation-state could enslave you."
The actual stated goals seem negative to me. I'm not very interested in erasing and surpassing human achievement in every way. It seems inherently bad. I don't think that's an absurd perspective.
I think the disconnect here is asked in "what is the purpose of life" and I don't think any reasonable interpretation of that is "be obscenely comfortable".
I'm kind of disappointed that their dashboard has been moved or offline or something for the past few years. https://b2510207.smushcdn.com/2510207/wp-content/uploads/202... is what it used to look like.
> The Lab's science and technology digital magazine presents the most significant research initiatives and accomplishments from national-security-related programs as well as projects that advance the frontiers of basic science. Our name is an homage to the Lab's historic role in the nation's service: During World War II, all that the outside world knew of the top-secret laboratory was the mailing address - P.O. Box 1663, Santa Fe, New Mexico.
https://researchlibrary.lanl.gov/about-the-library/publicati...
To the extent that further improvements to AI are either snake oil or just hard to monopolise on, doing everything else first is of course the best idea.
Even though I'm more on the side of finding these things impressive, it's not at all clear to me that the people funding their development will be able to monopolise the return on the investment - https://en.wikipedia.org/wiki/Egg_of_Columbus etc.
Also: the way the biggest enthusiasts are talking about the sectoral growth and corresponding electrical power requirements… well, I agree with the maths for the power if I assume the growth, but they're economically unrealistic on the timescales they talk about, and that's despite that renewables are the fastest %-per-year-growth power sector and could plausibly double global electrical production by the early 2030s.
The major question is: at what point will unaddressed climate change nullify these economic gains and make the fact that anyone worried about them feel silly in retrospect?
Or put another way, will we even have the chance collectively enjoy the benefits of that work?
- To what extent AI will actually be helpful in solving the climate crisis?
- To what extent the power generation growth fueled by AI will be critical to solving the climate crisis, and conversely, how badly we'll be screwed without it?
"Degrowth" is not an option. It hasn't been for a long time now. We can't just "social solution" our way out this problem anymore.
Electricity + home heating + cars is not 100%, but cutting emissions in half means you double the time before reaching any given threshold. For many problems the last 10% is the most challenging but we get 10x as long to solve it and 10x as long to implement mitigation strategies.
That’s what makes climate change critical, the next year is more important than what happens 10 or 20 years from now.
However, doing so requires ramping up electricity production and storage.
Add heating for buildings + hot water + industrial processes that can use electricity instead of fossil fuels alongside indirect effects like methane leaks from pipelines and drumroll.
We can get to ~50% reduction while saving money over using fossil fuels.
Too many AI accelerationists are treating these questions as foregone conclusions, which seems like an enormously dangerous bet without a clearer pathway to a truly beneficial outcome.
It may very well be that some form of AI (which form? hard to say - probably not LLMs) are a part of the solution. It may just as well be that they are not. But when building software, the age old advice to “start with the problem, not the solution” comes to mind.
The number of engineers I’ve worked with over the years (I’ve been one of them) who are convinced that the feature they’re building is the right thing to build, only to realize later that it doesn’t solve a problem people care about…is a very large number.
Regarding degrowth, I’m not advocating for it. With that said, that will be the unwanted result forced on us by the realities of the environment if we can’t put a lid on the climate issue.
This is not helpful. There are many reasons degrowth won't generally help humanity, but the benefits are particularly aplicable to western nations and their diplomatic relations. Certainly many western nations can bear degrowth without significant loss in quality of life. The wealthy just gotta take a significant cut to their waistlines.
> We can't just "social solution" our way out this problem anymore.
This certainly seems to be the liberal solution. Short of evicting them from our society what better choices do we have?
The people qualified to fix global warming aren't the same people qualified to work on ML.
And it's corollary: something being in the news or social media means everyone else has stopped working on other problems and is now solely working on whatever that headline's words say.
I've worked with hundreds of Data Scientists and every one had the ability to work on different problem areas. And so they were working on models to optimise truck rollouts or when to increase compute for complex jobs.
If as a society we placed an increased emphasis on efficiency and power consumption we would see a lot more models being used in those areas.
The problem is we don't want to do it.
I don’t understand this line of reasoning - it’s like saying “you’re not allowed steam engines until you drain all of your mines”. It’s moralistic, rather than pragmatic.
I think it's safe to call it a rhetorical fallacy because its underlying premise implies humanity can somehow only focus on a very finite number of things at once and that funds directed to 'your thing' would somehow otherwise be directed to 'their thing' which is even more absurd.
2. Inevitable AI winter
3. Keep running the plants, clean energy achieved, stop burning coal, global warming solved
1. Plan to build atomic power plants sufficient to supply electricity needs for projected future AI megaprojects.
2. Build the AI megaprojects first, because of fallacies related to sunk costs, urgency, and unwillingness to let the pyramid scheme collapse.
3. Never complete the nuclear plants, never get the pot of gold at the end of the AI rainbow, and pump a couple hundred million tons of CO2 into the atmosphere in the mean time.
These scales use energy usage as the measure of progress which you are saying is proof that reducing energy use is reducing progress.
I'd love for individuals to set their own limit, ideally one that pushes them a bit to step out of the endless chase for comfort and convenience that we're all stuck in. I definitely don't want a centrally planned and enforced limit though, that's never going to achieve the goals people would actually want.
Our current use of electricity per capita, at least in the west, is ridiculously unsustainable if we aren't willing to keep abusing fossil fuels as an efficient store and source of energy.
I'm a for reducing our impact, but we have to stop kidding ourselves with the pipe dream that we just need the right source of energy to be able to grow our energy use per capita indefinitely.
It can equally be argued that efficiency of use of power is a bellwether of societal advancement, since wasting it will always be easier than making efficient use of it.
Case in point, few people realize that humanity reached 1.9 on the Kardashev/Sagan scale (EG, we output more power than the entire Trappist star that exoplanet enthusiasts have been so excited to colonize) for 39 brief nanoseconds on October 30, 1961. But that certainly didn't make us an entire Kardashev rung more civilized for the same brief moment as we erased another huge portion of our natural environment.
At best its a needle in a haystack approach, and one that seems to toss out methodical, reasoned science in favor of a blunderbuss.
The absolute dollar value might seem high, because we're working with the budget of not just a country but the wealthiest country, but as a percentage it is quite low. You can certainly pull funds from other areas too, like the military, which also greatly benefit from such research endeavors.
Even if these were exclusively non-weapons and non-military based technologies being developed it'd be naive to not recognize that the best military defense is to stop a war before it happens. That comes through many avenues, but notably the development of new technologies, especially those that benefit people as a whole (e.g. medicine or see the Taiwan strategy). But even then, it would also be naive to think that the same technology couldn't be adapted to military uses. Anything can be a weapon if you use it wrong enough.
But note that we're also seeing a reduction in federal research funding. We're also seeing less fundamental research and these types of problems need a strong pipeline through the classic TRL scale[0]. I think you'll find some of that discussion over in yesterday's thread about Bell Labs. The pipeline doesn't work if you don't take risks and are rushing. You need a fire under your ass but too hot and you get burned. It's easy to be myopic in today's settings, and that's not a great idea for an organization who needs to have an outlook in terms of decades and centuries (i.e. government) as opposed to the next election cycle or next quarterly earnings report.
Mind you, we've done these things before. Both the Space Race and Manhattan Project. At the height of the Space Race NASA's budget was over 4.41% of the federal budget[2]. I'm not sure what percent the Manhattan Project's budget was, but it is very clear that this is A LOT cheaper than what actual war costs[3]. We're talking about less than a month of war costs. Remember, we spent over a $750bn over in Iraq[4]. The question is not if we have the money, but what we want to spend it on. Personally I'd rather stuff like this than bombing people. Frankly, you can eat the cake too, as it makes it cheaper to bomb people as well...
[0] https://en.wikipedia.org/wiki/Technology_readiness_level
[1] https://news.ycombinator.com/item?id=43957010
[2] https://en.wikipedia.org/wiki/Budget_of_NASA
[3] https://en.wikipedia.org/wiki/Manhattan_Project#Cost
[4] https://en.wikipedia.org/wiki/Financial_cost_of_the_Iraq_War
what are some examples of 'hardly' ?
I assume "AI" in contemporary articles, especially as it pertains to investments, means "Generative AI, especially or exclusively LLMs."
> Clearly computer networking is worthy of public investment, but given the capture of this administration by military industrial interests, how can we be sure that public networking funding isn't just handouts to the president's cronies?
Our country's tight relationship between the government, military, academia, and industrial has paid off repeatedly, even if it has some graft.
And discussed on HN: https://news.ycombinator.com/item?id=43765207
This does feel like a step change in the rate at which modern AI technologies and programs are being pushed out in their PR.
it was unpleasant.
this whole "benchmarks" thing is laughable. I've been using Gemini all week to do code assist, review patches, etc. Such impressive text, lists of bullets, suggestions, etc., but then at the same time it makes tons of mistakes, which you then call it on, and it predictably is like "oh sorry! of course!" yes of COURSE. because all it does is guess what word is most likely to come after the previous word. is there a "benchmark" for "doesn't hallucinate made up BS?" because humans can do very well on such a benchmark.
In any case, if you want to take a stab at it, feel free to go ahead and start a company whereby you as the owner would assign the CEO responsibilities to an AI.