Microsoft CEO says up to 30% of the company's code was written by AI

24 points by pseudolus 6 months ago

maronato 6 months ago

> Microsoft CEO Satya Nadella said that 20%-30% of code inside the company’s repositories was “written by software” — meaning AI

Does it mean AI though? Lots of lines in repositories are generated by software that isn’t AI. Dependency lock files, proto files, etc

IMO the wording is intentionally misleading.

karmakaze 6 months ago

Lot of leeway there. Almost 100% of my source is written by my text editor.

snkzxbs 6 months ago

I don’t believe that datum for a second considering how big their existing code bases are.

pier25 6 months ago

I don't believe it either but they probably meant 30% of the code committed not of their total codebase.
- gorjusborg 6 months ago
  
  By trying to understand, I believe you are expending more thought and effort by far than was made to make the statement.
dessimus 6 months ago

If any line of any codebase was written by AI, then up to 30% can be true, in the way that my ISP will claim ~900 Mbps still qualifies as up to 2 Gbps.
omneity 6 months ago

It doesn’t have to be contiguous AI-written chunks. This could also mean X% accepted AI suggestions on Y% codebase where X*Y < 30%.
AI suggestions can also be as simple as autocomplete and still be counted for the sake of engagement metrics.
Oh and in enterprise settings and especially MS shops Github Copilot is being pushed everywhere, (forceful) adoption rates are much higher than the market average.
Hydraulix989 6 months ago

Right, they're not going to want to risk breaking legacy application behavior.

mgkimsal 6 months ago

really really really depends on what sort of code is being 'written'. 20 years ago, IDEs would automatically create boilerplate getters and setters. In large projects that's a non-trivial amount of code. IDEs can autocomplete stuff already. For most of the folks I know doing non-trivial projects, AI tools are... useful autocompletes, but not much more. So... 25% of your code was done by AI but is it the hard nitty gritty stuff? The value prop of your whole company? Or is it just lots of boilerplate that is necessary because of all the abstractions we have at our disposal today (or... all the abstractions that are required to do anything 'modern' to use a negative light on it)?

alganet 6 months ago

Percentages are misleading.

How many lines in a diff are actually relevant code? Anyone who does reviews knows the answer.

That is one of the reasons why lean, terse languages are often better to review.

We can guess by those companies preferred coding styles and technologies whether their codebases are lean and terse or full of straw. And that should give us an estimate.

Of course, I could be wrong. They could be doing this measurement after removing irrelevant changes.

The better choice would be not to publish those sorts of claims if there is not a clear methodology that explains how the number was achieved.

sublinear 6 months ago

I swear it's as if the entire last 5 years of AI hype and sales were fueled by drugs.

SV_BubbleTime 6 months ago

The last few years of AI have been the longest decade.

dustingetz 6 months ago

this was reported as actually just IDE tab completion back when google claimed this stat last year

bananapub 6 months ago

google's internal and extremely sophisticated LLM completion thing is driven by IDE tab completion
- UncleMeat 6 months ago
  
  IMO, the key thing that software engineers want to know with these numbers is "is there still an engineer involved." In my mind LLM powered autocomplete that generates a lot of code is just totally different from "PM says they want this feature and the AI generates the entire thing from scratch" in that one amplifies the capabilities of an engineer while the other replaces them.
  - dustingetz 6 months ago
    
    who said anything about LLMs? Did Pichai specify that?
- dustingetz 6 months ago
  
  it also drives my gmail sentence autocomplete. it does not mean “30% of my email is written by AI”. It does help me type faster though. Reframing the one as the other is, imo, securities fraud. (I will asterisk that YC startups vibecoding their product is real, but that’s, like, 10^10 lower LOC scale than “all of Google”)

techpineapple 6 months ago

> Of course, it’s unclear how exactly Microsoft and Google are measuring what’s AI-generated versus not, so these figures are best taken with a grain of salt.

This does seem to me to be the key question, is anyone transparent about this? If not, why not?

hnav 6 months ago

The perception of having fallen behind in AI adversely impacts your stock price, the amount of capital you can marshal to actually compete in AI. What I think is actually happening industry-wide is that any sort of "intelligence" in software is slowly being rebranded as AI.
cratermoon 6 months ago

Why not? They want companies to buy their AI enabled slop.
- techpineapple 6 months ago
  
  Wouldn’t being transparent with how effective it was at writing code be a good sales tool?
  - rsynnott 6 months ago
    
    Only if it's effective at writing code, which it of course is not.
    Any time you see companies refusing to even vaguely define what metrics like this mean (or, for that matter, using non-standard metrics, like disclosing weekly active users but not monthly), it's generally a very strong signal that they're not interested in being transparent because the truth is, ah, not what they would like it to be.
  - sublinear 6 months ago
    
    Yes, and that's exactly why they're not transparent about it.

lotsoweiners 6 months ago

Technically 0.5% qualifies as up to 30% to the marketing crowd.

andsoitis 6 months ago

> Satya Nadella said that 20%-30% of code inside the company’s repositories was “written by software”

and

> The Microsoft CEO said the company was seeing mixed results in AI-generated code across different languages, with more progress in Python and less in C++.

So the CEO of Microsoft is saying that 20 - 30% of their code is being produced by computer systems that write poor code?

A4ET8a8uTh0_v2 6 months ago

Honestly, it is disappointing. Even joke jobs are taken away by LLMs. For shame.

Ekaros 6 months ago

Generated by software -> generated by AI seems huge logical leap. Then again maybe it can be for given meaning of "AI".

Not that 30% of code being automatically generated from templates or in some algorithmic way seem unbelievable. There is likely lot of code that could be generated by other code and it might even be reasonable choice.

proc0 6 months ago

Explains a lot.

bananapub 6 months ago

the actual quote (https://www.nbclosangeles.com/news/business/money-report/sat...):

> "I'd say maybe 20%, 30% of the code that is inside of our repos today and some of our projects are probably all written by software," Nadella said during a conversation before a live audience with Meta CEO Mark Zuckerberg.

which is clearly untrue, one assumes he meant "20%, 30% written since 2023 was partially generated by an LLM operated by a developer", but that doesn't sell stock.

mech422 6 months ago

So was it AI or code gen tools (interfaces generators, scaffolding, etc ?)

methuselah_in 6 months ago

Horrible why would you not let people eat on earth and in the name of saving you take away jobs. Let the AI because helpful as a tool to help people not to just take away jobs.

CyberMacGyver 6 months ago

Based on how overzealous these models[0] are to over engineer a solution it’s not surprising. I would imagine the real number is significantly lower.

[0] Claude 3.7 in my recent experience

masteruvpuppetz 6 months ago

My current VBA coding is all generated by Chat-GPT/Claude/Deepseek.

There is no use of writing VBA these days :@

whobre 6 months ago

And 75% of my code was “written” by copy/pasting…

williamtrask 6 months ago

If this isn’t jumping the shark it’s darn close.

returnInfinity 6 months ago

Doesn't mean 30% more productivity

strix_varius 6 months ago

...as anyone who's used MS Teams can attest.

lp0_on_fire 6 months ago

Good news then. If the last update to Outlook I received is any indicator they're coming for that, next.