How reliable is AI lke ChatGPT in giving you code that you request?

  • amio@kbin.social
    link
    fedilink
    arrow-up
    19
    ·
    1 year ago

    ChatGPT is a language model, it’s not intended for code and you’re using it “off label” at your own risk. It can produce working code, which is impressive in itself, but in order to know if it’s decent code you still need to be competent with that language. I had someone run a few prompts for me a while back, it ignored central parts of the query, and its output was basically like a very junior developer - fair enough, but not great or even that good.

    Potentially useful, but if you expect it to be more than one part of the “process”, you might be setting yourself up for trouble.

    Edit: just like it’s not a coder, it’s not a search engine or knowledge base, either. It just knows language and what seems like it ought to follow a given phrase. Be very aware of this difference, because sometimes it spits out 100% falsehoods with the same level of confidence and authority as the true stuff.

    • Kichae@kbin.social
      link
      fedilink
      arrow-up
      8
      ·
      edit-2
      1 year ago

      I think it’s important for people to also truly understand that generative machine learning models like ChatGPT also only “know” what they’ve seen before. There’s no interpretation or synthesis. It merely regurgitates what it’s seen, with some sampling from a probability distribution.

      This means, if you’re asking for something niche, and it’s only seen what you’re prompting it for once (or, really, the same text repeatedly across multiple websites) , there’s a very good chance that it will just recreate that artifact wholesale.

      Which means you need to be cognizant of what the license for that material is before you use it in a product!

  • AlteredStateBlob@kbin.social
    link
    fedilink
    arrow-up
    13
    ·
    1 year ago

    I have built several programs with ChatGPT 4 by now. From very basic Python scripts to Python webscrapers and C# in combination with Unity3D.

    In the beginning it was much better than it is currently. At the moment context is severely hampered no matter the limit and you’ll be bashing your head against circular arguments and it straight up ignoring stuff you just posted two messages ago.

    Trying to troubleshoot code it wrote a few days ago will be a slog and like dragging yourself over nails at times. Here’s what I have found to help and make life better:

    1. Be very, very, very precise in your instructions. And keep them saved, so you can reuse them later (point 4)
    2. From the very start plan to build your project with small functions that interact (good policy anyway) which makes troubleshooting and changing these functions much easier and will prevent you running into message limits.
    3. If it fails to work for you the way you need it, you’ll might have to scrap your entire code and start over with ChatGPT -> Again, the reason for point 2 being very important. Scrapping one functin is much less painful than an entire tool.
    4. Start new chats when you feel the quality degrading. Sometimes it helps and since the context is garbage at the moment anyway, it doesn’t matter much.
    5. Post the code it is supposed to fix every single time. It will inevitably refer to other code, code hallucinations, etc. otherwise. Again why point 2 is important.
  • Utilael@kbin.social
    link
    fedilink
    arrow-up
    8
    ·
    edit-2
    1 year ago

    I agree with the other comments that ChatGPT isn’t really that good for programming, it hallucinates often and you end up working too hard just to try and figure out what it got wrong. However, I have found a good AI engine, phind.com, that has started to replace my google searches. It’s just a wrapper for ChatGPT, but it cites its sources so you can verify or dig deeper, provides search engine results in a sidebar and has upvote/downvote options to help it improve. So it feels like a personal google “agent” that runs off and googles something for you and comes back with a concise report.

    • Deely@kbin.social
      link
      fedilink
      arrow-up
      3
      ·
      edit-2
      1 year ago

      Personally I just can’t work with system that lies to me (even for a little) but all the time.
      I tried to use chatGpt and Bing bot and phind.com few times and everytime I got answers that looks like real and looks like correct answer but slightly (and few times completely) wrong.
      Everytime I have to reread documentation, check links, investigate is there a reason why LLM answered this way, maybe I wrong this time and LLM found something that I did not found…

      I agree that phind.com get best results, but every small incorrectness here and there irks me and makes me question myself and answer as whole.

      Upd: in general questions, like when you trying to investigate some new field, technology, tooling suite LLM is very, very good. When you want to get something like overview of topic that you interested in.

  • Flaky@kbin.social
    link
    fedilink
    arrow-up
    5
    ·
    edit-2
    1 year ago

    I’ve used ChatGPT to answer questions relating to Python. Notably, I asked it how to use QtNetwork to send and receive requests with authentication, as the application I was using did not use any non-standard modules I was more accustomed to like requests but did have PyQt. Not only did it gave me working code snippets but explained it in a way that I was able to understand. No, it’s not perfect. But man it’s better than hunting Google for that one StackOverflow post.

    I have heard it trips up on certain less-used programming languages like Swift though, so depending on your use case YMMV. I’ve also not used Codex but a friend of mine has. Apparently it really liked to mention this one specific GitHub profile.

    For shits and giggles I asked ChatGPT a while back to represent a Pokemon with a Python class, and it gave me working code. Google Bard would trip up and not use the class when I told it to.

    • ___hulk@kbin.social
      link
      fedilink
      arrow-up
      3
      ·
      1 year ago

      Like any other tool, it’s as good as you use it. If you can explain complex problems in smaller bites with clear objectives it helps a lot.

      • bored_pistachio@kbin.social
        link
        fedilink
        arrow-up
        3
        ·
        1 year ago

        Yes, but no.
        You can be as precise as you want, if ChatGPT didn’t have enough training material you won’t get good answer even if you bend over backwards.
        I can’t complain 95% of time, however prompt is not to blame all the time.

  • soundasleep@kbin.social
    link
    fedilink
    arrow-up
    3
    ·
    1 year ago

    Software engineer with decades of experience here - ChatGPT can give you mostly-working code for solved problems, but with occasionally subtle and weird bugs. It’s very confident and will happily hallucinate. It will not help you with debugging or integrating, which is the majority of coding. It’s a pattern matching engine, nothing more.

  • LoafyLemon@kbin.social
    link
    fedilink
    arrow-up
    3
    ·
    edit-2
    1 year ago

    Not at all. It often gives bad answers, or workarounds rather than working code. It’s not useful to me if I have to fix its code, when I can do it more efficiently and quicker by my own.

    With some hand-holding, it’s quite decent at reading and documenting functions, which is what I use it for since I’m too lazy to document them myself. :P

  • i_need_a_vacation@kbin.social
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    For basic things like syntax that I can’t remember it’s actually pretty good, way more faster than google IMO.

    When I ask for something a little more complicated it can go two ways:

    • Actually doing a good job and generating something that I can use (I often have to polish that code, but still, it’s better than expected)

    • Doing whatever that I didn’t ask for, so when I point out that’s not what I meant and explain it further, it enters a bucle where it goes back to the two same solutions.

    “You didn’t like solution A? Here, there’s solution B. Solution B is terrible? Here, there’s A again.”

    Overall it’s a great tool though.

  • PolarBearSequence@kbin.social
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    I quite like GitHub Copilot and use it a lot, but I find ChatGPT not all that useful.
    For actual coding, it feels like describing what I want it to do is more complicated than doing it myself.
    I can see some uses as a search engine, but I’ve had a lot of bad luck where it suggested code that was plain wrong or not working and often did not even compile, so most of the time, I’d rather look on GitHub, Stackoverflow or sites like that.

  • Otome-chan@kbin.social
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    I use chatgpt a lot when coding. It’s pretty good and the code is typically usable. But sometimes it messes up hard and it can take a while to realize that. Net benefit though and I’m sure the technology will improve over time.

  • Cyzaine@kbin.social
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    Used chat gpt 3.x to assist in Godot and gdscript logic for our game… Nearly always wrong but often gets me thinking in a different direction than the one I was stuck in. So certainly value there.

  • asteroidrainfall@kbin.social
    link
    fedilink
    arrow-up
    2
    ·
    edit-2
    1 year ago

    Outside producing one simple WebPack configuration, I haven’t had good experiences using ChatGPT. It often causes me more trouble than it helps. I’ve tried to use it multiple times to write some BASH script, and every time it gives me know that looks nice but is just broken. It’s not syntactically incorrect, it’s more like functionally incorrect.

    For example, it told me that you could pass arrays as function arguments, which you can’t do. Or, it gave me a script that was using variables within a URL string that would be passed into CURL, which won’t work since the URL won’t be encoded properly.

    When I do it, I spend more time trying to fix the code that it gives me. Which, I guess, does have the benefit that it means I got to learn something afterward (both examples above I didn’t know about until ChatGPT gave me the bad code).

    The thing that ensured me that AI won’t take over the programming side of software engineering was when I asked ChatGPT to help me out with some date-time bugs. It just kept making up native JavaScript API functions, couldn’t understand how to parse UTC to figure out a date-time’s timezone, among other issues. The day that AI is able to solve software issues around date-times or currencies is the day that we’ll all be out of a job.

    Edit:
    I guess you could summarize using ChatGPT is like peer-programming with an overly confident CS grad.

    • experbia@kbin.social
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      peer-programming with an overly confident CS grad
      I love this, and agree. I’ve always said that for all tasks, it’s like you’re working with an ADHD eager-to-please intern.

  • mfz@kbin.social
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    It is a conversational tool that can generate decent code if properly prompted but it lacks for the most part enough context. For it to be really useful it has to be able to be trained on my entire project that I’m working with, not just a single file or function.

    What I miss is the ability to “chat with my project”. I.e. have the whole project in the trained context, and then reason about architectural changes, pros and cons, have suggestions for refactoring, help with complex renaming schemes and moving code, etc.

    It would be super interesting to be able to give instructions like:

    • Organise my files by dependency and the logic they implement.
    • Or something like, create web components from common input elements in my html pages.
    • Where is the user auth code implemented?

    Things like that.

    • experbia@kbin.social
      link
      fedilink
      arrow-up
      2
      ·
      1 year ago

      This would really be extremely helpful, absolutely agree. A mode with less of a view on the details of the code and more with the architecture of it. I wonder if an extension like Codeium could be extended so that only the method signatures and comments and such could be sent in as context so it can reason more generally about your project…

  • ZILtoid1991@kbin.social
    link
    fedilink
    arrow-up
    2
    ·
    1 year ago

    I haven’t used it, however from what I’ve heard it’s more of a less toxic Stackoverflow that can hallucinate things, rather than some magic that writes you code based on some words. It still suffers from the usual AI weakness of the AI itself not understanding context, only knowing it.

  • experbia@kbin.social
    link
    fedilink
    arrow-up
    2
    ·
    edit-2
    1 year ago

    I find ChatGPT to be less useful for code and more useful for generating boilerplate more in the ‘configuration’ realm. Ansible playbooks or tasks, Nginx configs, Dockerfiles, docker-compose files, etc. Well-bounded things with an abundance of clear documentation.

    I generate a lot of first-draft Dockerfiles and docker-compose files through ChatGPT now with a short description of what I want. It’s always worth reviewing it because sometimes it just invents things that look like a Dockerfile, but it can save a lot of the boring boilerplate writing of volumes and networks and depends_ons and obvious env vars you need to override.

    I do use Codeium in my VS Code instance, though. It’s like a free more ethical Github Copilot, and I’ve been really really happy with it. Not so much to make a whole program, but I use it a lot more as a kind of super-autocomplete.

    I’ll go in to a class and go to a method that needs a change and I’ll just type a comment like the following and it will basically spit out the authentication logicc that I do a quick review on.

    // check the request authentication header against the user service to verify we're allowed to do this
    
    

    It’s also an amazing “static” debugger - I can highlight particularly convoluted segments of math or recursion or iteration and ask it to explain it. Then I can ask follow-up questions like “Is there any scenario in which totalFound remains at 0” and it will tell me yes or no and why it thinks that, which is really nice. I tend to save it for instances where I’m reasonably certain that it was all correct, but I wanted to check it instead. Now instead of breaking out the paper and pen and reasoning it out, I can ask it for a second opinion, and if it has no doubts, my paranoid mind is put at ease a bit.

    I’ve been unimpressed with the ability of any of these “AI” systems to spit out larger volumes of good code. They’re more like ADHD, eager-to-please little interns. They’ll spit out the first answer that comes to their mind even if it’s wrong, and they fall for all kinds of well-known development pitfalls.