Amazon and AWS Developers May Not Want To Invite Their CEOs To Java Code Reviews 47

Posted by msmash on Monday August 26, 2024 @12:01PM from the closer-look dept.

theodp writes: Typos happen to the best of us, but spelling still counts when it comes to software development. So, it's kind of surprising to see that both Amazon CEO Andy Jassy and former AWS CEO Adam Selipsky failed to notice an embarrassing typo in a demo video they offered to their millions of followers on social media as evidence of Amazon Q AI's Java upgrade capabilities, which Amazon has been trumpeting for months in SEC filings, shareholder communication, and Amazon's latest earnings call with Wall Street analysts.

Just 37 seconds into the demo of the software that Amazon says saved it 4,500 developer-years of work and provided an additional $260M in annualized efficiency gains, Amazon Q kicks off the Java upgrade conversation by saying, "I can help you upgrade your Jave [sic] 8 and 11 codebases to Java 17." The embarrassing misspelling did prompt Twitter user @archo5dev to alert Jassy to the typo, but there's been no response yet from Jassy, who boasted that Amazon developers were unable to find any mistakes in Q's work in "79% of the auto-generated code reviews."

It's probably worth noting that both Jassy and Selipsky opted to showcase a drop-dead simple demo of Amazon Q Code Transformation rather than some of the lengthier and less-magical demos of the product.

Amazon and AWS Developers May Not Want To Invite Their CEOs To Java Code Reviews

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 47 Comments Log In/Create an Account

Comments Filter:

What's the big deal? (Score:5, Funny)

by smooth wombat ( 796938 ) writes: on Monday August 26, 2024 @12:21PM (#64736136) Journal

Developers misspell words all the time. In fact, trying to read what they write is like trying to read another language.

- Re: (Score:2)
  
  by gweihir ( 88907 ) writes:
  
  The big deal is CEOs being too dumb to have some experts review important slides.
  - Re: (Score:2)
    
    by chill ( 34294 ) writes:
    
    Probably a Harfurd graduate.
    - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
      - Re: (Score:2)
        
        by ShanghaiBill ( 739463 ) writes:
        
        If you're from MIT, you use the self-checkout to avoid talking to a human.
        
        Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
        
        Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        I do not see the point of it. All it does is give them my shopping history at a laughable discount. And it is slower in addition.
  - - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
      - Re: (Score:2)
        
        by gweihir ( 88907 ) writes:
        
        I have done that as well. But it was on a transatlantic flight and I already had a sketch.
- Re: (Score:2)
  
  by Murdoch5 ( 1563847 ) writes:
  
  I tend to agree, it's a simple, funny, almost cute mistake. The bigger issue is if it can't get the spelling of Java correct, then how can you trust it with more complex code?
  
  Would I bug a developer about it? No, probably not, but this isn't a human developer, it's a system that needs to better, more efficient, and correct. If we're going to trust AI, it can't make simple mistakes like this. Yet another AI system that has to go back to the drawing board / training.
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
  - Re: What's the big deal? (Score:2)
    
    by Bodrius ( 191265 ) writes:
    
    The worst part is this is precisely the sort of "simple, but repetitive and mindnumbingly boring" review tasks that are error prone for humans, and we have decades of experience handling with traditional software.
    This is less informative about how good the LLMM at doing LLMM-appropiate things. But if you can't be bothered to do anything but wrap the model as a black-box, I cannot be bothered to use and test that output.
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
- Re: (Score:2)
  
  by StormReaver ( 59959 ) writes:
  
  The big deal is that this illustrates that AI is not intelligent, but too many people have been fooled into thinking it is. The software simply did a statistical analysis of its data set, encountered "Jave 8" and "Jave 11" enough times to pass its heuristics tests, and included them for no other reason than they satisfied an equation or an inequality. Relying on these system to be accurate is extremely foolish, but all too many people in high places haven't realized this yet.
  This is the state of AI, and it'
  - Re: (Score:2)
    
    by olmsfam ( 1399493 ) writes:
    
    The big deal is that this illustrates that AI is not intelligent, but too many people have been fooled into thinking it is. The software simply did a statistical analysis of its data set, encountered "Jave 8" and "Jave 11" enough times to pass its heuristics tests, and included them for no other reason than they satisfied an equation or an inequality. Relying on these system to be accurate is extremely foolish, but all too many people in high places haven't realized this yet.
    You must not work with humans much. Humans are not intelligent. Their brains just encountered more "statistical analysis of its data set"s. Give it 20 years to cook and it will appear more intelligent than "human machine" learning models, and be infinitely cloneable.
Frist (Score:2)

by BigEddieD ( 9191421 ) writes:

Lol whoever signed off on that particular video, and possibly whoever sold Jassy on using that specific video (if it wasnt his own brilliant idea) are going to be having fun polishing up their CVs and clearing out their desks. After something like this, even if they don't find a reason to just fire your ass, they'll forever more look at you as the asshole that made them look stupid in front of the plebs, and that my friends is a crime of unfathomable depth and unrelenting menace in the eyes of the average C
Strange and not strange. (Score:2)

by fuzzyfuzzyfungus ( 1223518 ) writes:

It seems bizarre and inexplicable(unless there really are just that many true believers in the approval path) that someone's carefully stage-managed hype-demo would go out with an obvious spelling mistake; seems like a situation where it would have cost a pittance in both relative and absolute terms to have some junior minion go over it for that sort of thing; but it seems much less surprising that a code upgrade tool would be really weak on spell check. Many, probably most of the widely used ones, programm
- Re: (Score:2)
  
  by jd ( 1658 ) writes:
  
  Microsoft has had live demos crash, and Apple have faked "live demos" of products they'd not built yet.
  The public get a good giggle, because the public don't actually think far enough ahead to realise that these sorts of failures are because the products being sold are defective.
  - Comment removed (Score:4, Insightful)
    
    by account_deleted ( 4530225 ) writes: on Monday August 26, 2024 @12:42PM (#64736226)
    
    Comment removed based on user account deletion
    
- Re: (Score:2)
  
  by Cyberax ( 705495 ) writes:
  
  It seems bizarre and inexplicable(unless there really are just that many true believers in the approval path) that someone's carefully stage-managed hype-demo would go out with an obvious spelling mistake
  Honestly, this typo makes it more believable. It really looks like an actual example of code, not a stage-managed fake demo.
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
Scary (Score:3)

by jd ( 1658 ) writes: <imipak@nOsPaM.yahoo.com> on Monday August 26, 2024 @12:31PM (#64736178) Homepage Journal

Anyone who thinks that such tools are anything more than gimics at this stage, when we know that they tend to leave massive security holes, is waving a red flag at the black hat community. And I don't need to tell anyone here that Amazon is vulnerable in three distinct areas - their online shopping, their automated warehouses, and their cloud.
I'm not saying it's inevitable, by any means, but if Russia or North Korea successfully damage the credibility of any one of those three, the impact across the economy won't be insignificant.
They're highly vulnerable targets and this is an exceptionally dangerous time (what with a war in and around Russia and an election in the US).
Now is when they should be doing a Manhattan Project Meets OpenBSD Strategy and nailing every last byte firmly to the ground.
But, no, they want to show off Nice Shiny Toys that can't actually work.

Slow day. (Score:5, Insightful)

by SmaryJerry ( 2759091 ) writes: on Monday August 26, 2024 @12:41PM (#64736222)

Typos are news new?

- Re: (Score:2)
  
  by Tony Isaac ( 1301187 ) writes:
  
  It's news when it's about a tool that Amazon bragged that the tool was so good that "79% of the auto-generated code reviews without any additional changes." Maybe the reviewers didn't notice when things were misspelled?
Not wrong (Score:2)

by ebcdic ( 39948 ) writes:

It's a perfectly plausible plural of Java.
- Re: (Score:2)
  
  by Chris Mattern ( 191822 ) writes:
  
  "It's a perfectly plausible plural of Java."
  In what language?
  Provide other examples of a word ending in -a that is pluralized by replacing the a with an e, please.
  - Re: Not wrong (Score:2)
    
    by BadgerStork ( 7656678 ) writes:
    
    English doesn't need citations and prior art :)
    But java plural would perhaps be javae?
    Person -> People ? Hmmm...
    - Re: (Score:2)
      
      by hughbar ( 579555 ) writes:
      
      I run the Elvis races every year: https://elvisraces.club/ [elvisraces.club] and always refer to them as Elvae. Some v. rude people wince.
      - Re: (Score:2)
        
        by Chris Mattern ( 191822 ) writes:
        
        *Adding* an e can kinda work; first declension Latin nouns work that way. *Replacing* the a with an e, not so much.
        "Elvae"? I've always heard "Elvii". Neither one really works; it really should me "Elves", with the second e long, not silent.
Prepare for a tsunami... (Score:2)

by MpVpRb ( 1423381 ) writes:

...of awful, half-baked AI crap as CEOs rush to jump on the hype wagon
I'm optimistic that AI will eventually be useful, but the first wave of implementations will suck mightily
Oh, stewardess! (Score:4, Funny)

by jfdavis668 ( 1414919 ) writes: on Monday August 26, 2024 @01:01PM (#64736286)

I speak Jave.

You Don't Understand (Score:1)

by The Cat ( 19816 ) writes:

If asked, the man will say "so?" and then go on to destroy the lives of thousands more.
Of course it's unjust. That's the whole point.
Code typos (Score:3)

by engineer37 ( 6205042 ) writes: on Monday August 26, 2024 @01:16PM (#64736334)

If it can make a typo like that in text, it can make a typo like that in code. Although some code typos wonâ(TM)t compile others will compile and run fine, but introduce serious bugs. Lots of security bugs are as simple as off by one typos. Just imagine what can happen if the wrong variable name is used somewhere.

Jave Talkin' (Score:2)

by theodp ( 442580 ) writes:

With apologies to the Bee Gees [genius.com]: Jave talkin', you're telling me lies, yeah / Good lovin' still gets in my eyes / Nobody believe what you say / It's just your Jave talkin' that gets in the way
What is the upgrade? (Score:3)

by bradley13 ( 1118935 ) writes: on Monday August 26, 2024 @01:22PM (#64736360) Homepage

It remains unclear: what exactly is the upgrade? You can run Java 8 code just fine under Java 17, so "do nothing" would work. Are they adding genetics? Lambdas? Streams? Or none of that?
If it's a trivial search handler place of deprecated functions, thenno one should be impressed. If it's a major code rewrite, well, no, that's not believable...

- Re: (Score:2)
  
  by Tony Isaac ( 1301187 ) writes:
  
  Maybe this is why so many of the code reviews required no changes! They just had a commit message that said "Upgraded from Java 8 to 17." Excellent, another flawless code review complete!
- Re: (Score:2)
  
  by Chuck Chunder ( 21021 ) writes:
  
  It basically updates your code to work to be compatible with most recent major versions of libraries/frameworks and uplifts some deprecated functions usage. IIRC it is leveraging Openrewrite [openrewrite.org] underneath.
  
  We have used it to upgrade some old code with a bunch of dependencies as as a trial and It did a good job, but mostly in the sense of doing something dull and uninventive quickly and well. Doing in an hour or so what would have probably taken a few days of boring iterative work for a developer otherwise.
Am I missing some context? (Score:2)

by Petersko ( 564140 ) writes:

The AI said a"Jave" instead of "Java" so the whole thing should come tumbling down? Fuck off. In the pantheon of demo errors, this doesn't even deserve a mention.
This article isn't just unnecessary... it's stupid. Just like any potential user who walks away from the tech simply because of this one error.
Oh thats cute (Score:2)

by kopecn ( 1962014 ) writes:

The classic argument over symbol naming. And spelling. And the 80-20 rule.
At first I thought the reviews were 'auto' (Score:2)

by OneOfMany07 ( 4921667 ) writes:

The title felt like it was saying something else at first (that someone invited them accidentally). Then I thought the AI was doing code reviews on its own code.
I just wish we could make a tool to summarize better for us. Meaning read, understand, then rewrite (possibly customized for the reader/consumer of said content). Of course there goes a lot of wasted, paid for work. I'm thinking business and government jobs, not just 'journalists' that have become bloggers in reality. And don't get me started o
79% success rate (Score:2)

by byennie ( 1126011 ) writes:

"Amazon developers were unable to find any mistakes in Q's work in "79% of the auto-generated code reviews."
Uhhhh, so 21% of the code reviews had mistakes? What kind of mistakes? Is that good?
I always enjoy some good old fashioned "this is a good number" statistics. For all we know those code reviews, if trusted, will lead to worse results than before.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

What's the big deal? (Score:5, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: What's the big deal? (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Frist (Score:2)

Strange and not strange. (Score:2)

Re: (Score:2)

Comment removed (Score:4, Insightful)

Re: (Score:2)

Re: (Score:2)

Scary (Score:3)

Slow day. (Score:5, Insightful)

Re: (Score:2)

Not wrong (Score:2)

Re: (Score:2)

Re: Not wrong (Score:2)

Re: (Score:2)

Re: (Score:2)

Prepare for a tsunami... (Score:2)

Oh, stewardess! (Score:4, Funny)

You Don't Understand (Score:1)

Code typos (Score:3)

Jave Talkin' (Score:2)

What is the upgrade? (Score:3)

Re: (Score:2)

Re: (Score:2)

Am I missing some context? (Score:2)

Oh thats cute (Score:2)

At first I thought the reviews were 'auto' (Score:2)

79% success rate (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals