• Star@sopuli.xyzOP
    link
    fedilink
    English
    arrow-up
    388
    arrow-down
    14
    ·
    edit-2
    10 months ago

    It’s so ridiculous when corporations steal everyone’s work for their own profit, no one bats an eye but when a group of individuals do the same to make education and knowledge free for everyone it’s somehow illegal, unethical, immoral and what not.

    • richieadler@lemmy.myserv.one
      link
      fedilink
      English
      arrow-up
      21
      ·
      edit-2
      10 months ago

      Cue the Max Headroom episode where the blanks (disconnected people) are chased by the censors because the blanks steal cable so their children can watch the educational shows and learn to read, and they are forced to use clandestine printing presses to teach them.

    • Grimy@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      10 months ago

      Using publically available data to train isn’t stealing.

      Daily reminder that the ones pushing this narrative are literally corporation like OpenAI. If you can’t use copyright materials freely to train on, it brings up the cost in such a way that only a handful of companies can afford the data.

      They want to kill the open-source scene and are manipulating you to do so. Don’t build their moat for them.

      • givesomefucks@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        10 months ago

        And using publicly available data to train gets you a shitty chatbot…

        Hell, even using copyrighted data to train isn’t that great.

        Like, what do you even think they’re doing here for your conspiracy?

        You think OpenAI is saying they should pay for the data? They’re trying to use it for free.

        Was this a meta joke and you had a chatbot write your comment?

        • tourist@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          10 months ago

          Was this a meta joke and you had a chatbot write your comment?

          if someone said this to me I’d cry

        • webghost0101@sopuli.xyz
          link
          fedilink
          English
          arrow-up
          0
          ·
          edit-2
          10 months ago

          The point that was being made was that public available data includes a whole lot amount of copyrighted data to begin with and its pretty much impossible to filter it out. Grand example, the Eiffel tower in Paris is not copyright protected, but the lights on it are so you can only using pictures of the Eiffel tower during the day, if the picture itself isn’t copyright protected by the original photographer. Copyright law has all these complex caveat and exception that make it impossible to tell in glance whether or not it is protected.

          This in turn means, if AI cannot legally train on copyrighted materials it finds online without paying huge sums of money then effectively only mega corporation who can pay copyright fines as cost of business will be able to afford training decent AI.

          The only other option to produce any ai of such type is a very narrow curated set of known materials with a public use license but that is not going to get you anything competent on its own.

          EDIT: In case it isn’t clear i am clarifying what i understood from Grimy@lemmy.world comment, not adding to it.

          • givesomefucks@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            10 months ago

            That’s insane logic…

            Like you’re essentially saying I can copy/paste any article without a paywall to my own blog and sell adspace on it…

            And your still saying OpenAI is trying to make AI companies pay?

            Like, do you think AI runs off free cloud services? The hardware is insanely expensive.

            And OpenAI is trying to argue the opposite, that AI companies shouldn’t have to pay to use copyrighted works.

            You have zero idea what is going on, but you are really confident you do

            • webghost0101@sopuli.xyz
              link
              fedilink
              English
              arrow-up
              0
              ·
              10 months ago

              I clarified the comment above which was misunderstood, whether it makes a moral/sane argument is subjective and i am not covering that.

              I am not sure why you think there is a claim that openAI is trying to make companies pay, on the contrary the comment i was clarifying (so not my opinion/words) states that openAI is making an argument that anyone should be able to use copyrighted materials for free to train AI.

              The costs of running an online service like chatgpt is wildly besides the argument presented. You can run your own open source large language models at home about as well as you can run Bethesda’s Starfield on a same spec’d PC

              Those Open source large language models are trained on the same collections of data including copyrighted data.

              The logic being used here is:

              If It becomes globally forbidden to train AI with copyrighted materials or there is a large price or fine in order to use them for training then the Non-Corporate, Free, Open Source Side of AI will perish or have to go underground while to the For-Profit mega corporations will continue exploit and train ai as usual because they can pay to settle in court.

              The Ethical dilemma as i understand it is:

              Allowing Ai to train for free is a direct threat towards creatives and a win for BigProfit Enthertainment, not allowing it to train to free is treat to public democratic AI and a win for BigTech merging with BigCrime

              • Grimy@lemmy.world
                link
                fedilink
                English
                arrow-up
                0
                arrow-down
                1
                ·
                10 months ago

                That is very well put, I really wish I could have started with that.

                Though I envision it as a loss for BigProfit Enthertainment since I see this as a real boon for the indie gaming, animation and eventually filmmaking industry.

                It’s definitely overall quite a messy situation.

          • RainfallSonata@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            arrow-down
            1
            ·
            10 months ago

            I didn’t want any of this shit. IDGAF if we don’t have AI. I’m still not sure the internet actually improved anything, let alone what the benefits of AI are supposed to be.

            • RememberTheApollo@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              ·
              10 months ago

              It doesn’t matter what you want. What matters is if corporations can extract $ from you, gain an efficiency, or cut their workforce using it.

              That’s what the drive for AI is all about.

            • Grimy@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              arrow-down
              1
              ·
              10 months ago

              You don’t have to use it. You can even disconnect from the internet completely.

              Whats the benefit of stopping me from using it?

        • Grimy@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          arrow-down
          1
          ·
          10 months ago

          If the data has to be paid for, openAI will gladly do it with a smile on their face. It guarantees them a monopoly and ownership of the economy.

          Paying more but having no competition except google is a good deal for them.

          • givesomefucks@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            edit-2
            10 months ago

            Eh, the issue is lots of people wouldn’t be willing to sell tho.

            Like, you think an author wants the chatbot to read their collected works and use that? Regardless of if it’s quoting full texts or “creating” text in their style.

            No author is going to want that.

            And if it’s up to publishers, they likely won’t either. Why take one small payday if that could potentially lead to loss of sales a few years down the row.

            It’s not like the people making the chatbits just need to buy a retail copy of the text to be in the legal clear.

            • Grimy@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              arrow-down
              1
              ·
              10 months ago

              The publisher’s will absolutely sell imo. They just publish, the book will be worth the same with or without the help of AI to write it.

              I guess there is a possibility that people start replacing bought books with personalized book llm outputs but that strikes me as unlikely.

      • grue@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        10 months ago

        They want to kill the open-source scene

        Yeah, by using the argument you just gave as an excuse to “launder” copyleft works in the training data into permissively-licensed output.

        Including even a single copyleft work in the training data ought to force every output of the system to be copyleft. Or if it doesn’t, then the alternative is that the output shouldn’t be legal to use at all.

        • Grimy@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          arrow-down
          1
          ·
          10 months ago

          100% agree, making all outputs copyleft is a great solution. We get to keep the economic and cultural boom that AI brings while keeping the big companies in check.

      • kibiz0r@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        10 months ago

        We have a mechanism for people to make their work publically visible while reserving certain rights for themselves.

        Are you saying that creators cannot (or ought not be able to) reserve the right to ML training for themselves? What if they want to selectively permit that right to FOSS or non-profits?

        • Grimy@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          arrow-down
          1
          ·
          10 months ago

          Essentially yes. There isn’t a happy solution where FOSS gets the best images and remains competitive. The amount of data needed is outside what can be donated. Any open source work will be so low in quality as to be unusable.

          It also won’t be up to them. The platforms where the images are posted will be selling and brokering. No individual is getting a call unless they are a household name.

          None of the artists are getting paid either way so yeah, I’m thinking of society in general first.

      • TwilightVulpine@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        10 months ago

        OpenAI is definitely not the one arguing that they have stole data to train their AIs, and Disney will be fine whether AI requires owning the rights to training materials or not. Small artists, the ones protesting the most against it, will not. They are already seeing jobs and commission opportunities declining due to it.

        Being publicly available in some form is not a permission to use and reproduce those works however you feel like. Only the real owner have the right to decide. We on the internet have always been a bit blasé about it, sometimes deservedly, but as we get to a point we are driving away the very same artists that we enjoy and get inspired by, maybe we should be a bit more understanding about their position.

        • Grimy@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          arrow-down
          1
          ·
          edit-2
          10 months ago

          Thats basically my main point, Disney doesn’t need the data, Getty either. AI isn’t going away and the jobs will be lost no matter what.

          Putting a price tag in the high millions for any kind of generative model only benefits the big players.

          I feel for the artists. It was already a very competitive domain that didn’t really pay well and it’s now much worse but if they aren’t a household name, they aren’t getting a dime out of any new laws.

          I’m not ready to give the economy to Microsoft, Google, Getty and Adobe so GRRM can get a fat payday.

          • TwilightVulpine@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            ·
            10 months ago

            If AI companies lose, small artists may have the recourse of seeking compensation for the use and imitation of their art too. Just feeling for them is not enough if they are going to be left to the wolves.

            There isn’t a scenario here in which big media companies lose so talking of it like it’s taking a stand against them doesn’t make much sense. What are we fighting for here? That we get to generate pictures of Goofy? The small AI user’s win here seems like such a silly novelty that I can’t see how it justifies just taking for granted that artists will have it much rougher than they already have.

            The reality here is that even if AI gets the free pass, large media and tech companies are still primed to profit from them far more than any small user. They will be the one making AI-assisted movies and integrating chat AI into their systems. They don’t lose in either situation.

            There are ways to train AI without relying on unauthorized copyrighted data. Even if OpenAI loses, it wouldn’t be the death of the technology. It may be more efficient and effective to train them with that data, but why is “efficiency” enough to justify this overreach?

            And is it even wise to be so callous about it? Because it’s not going to stop with artists. This technology has the potential to replace large swaths of service industries. If we don’t think of the human costs now, it will be even harder to make a case for everyone else.

            • Grimy@lemmy.world
              link
              fedilink
              English
              arrow-up
              0
              arrow-down
              1
              ·
              10 months ago

              I fully believe AI will be able to replace 50% or more of desk jobs in the near future. It’s definitely a complicated situation and you make good points.

              First and foremost, I think it’s imperative the barrier for entry for model training is as low as possible. Anything else basically gives a select few companies the ability to charge a huge subscription fee on all our goods and services.

              The data needed is pretty heavy as well, it’s not very pheasible to go off of donated or public domain data.

              I also think any job loss is virtually guaranteed and trying to save them is misguided as well as not really benefiting most of those affected.

              And yea, the big companies win either way but if it’s easier to use this new tech, we might not lose as hard. Disney for instance doesn’t have any competition but if a bunch of indie animation companies and groups start popping up, it levels the playing field a bit.

      • deweydecibel@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        edit-2
        10 months ago

        The point is the entire concept of AI training off people’s work to make profit for others is wrong without the permission of and compensation for the creator regardless if it’s corporate or open source.

        • ANGRY_MAPLE@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          0
          ·
          10 months ago

          I think I’ve decided to not publish anything that I want to keep ownership of, just in case. There’s an entire planet’s worth of countries, which will all have their own sets of laws. It takes waay too long to polish something, only to just give it away for free haha. Someone else is free to do that work if it is that easy. No skin off my back.

          I think it’s similar to many other hand-made crafts/items. Most people will buy their clothes from stores, but there are definitely still people who make beautiful clothing from hand better than machines could.

          Don’t even get me started on stuff like knitting. It already costs the creator a crap ton of money just for the materials. It takes a crap ton of time to make those, too. Despite the costs, many people just expect those knitted pieces for practically free. The people who expect that pricing are also free to go with machine-produced crafts/items instead.

          It comes down to what people want, and what they’re willing to pay, imo. Some people will find value in something physically being put together by another human, and other people will find value in having more for less. Neither is “wrong” necessarily, so long as no one is literally ripped off. (With over 8 billion people, it’s bound to happen at least once. I feel bad for whoever that is.)

          That being said, we’ll never be able to honestly say that the specific skills and techniques that are currenty required are the exact same. It would be like calling a photographer amazing at realism painting because their photo looks like real life. Photographers and painters both have their place, but they are not the exact same.

          I think that’s also part of what’s frustrating so many artists. Coding AI is not the same as using the colour wheel, choosing materials, working fine motor control, etc. It’s not learning about shadows, contrast, focal points, etc. I can definitely understand people not wanting those aspects to be brushed off, especially since it usually takes most of a lifetime to achieve. A music generator and a violin may both make great music, but they are not the same, and they require different technical skills.

          I’ll never buy AI art if I have any say in the matter. I’ll support handmade stuff first, every time.

          • Grimy@lemmy.world
            link
            fedilink
            English
            arrow-up
            0
            arrow-down
            1
            ·
            edit-2
            10 months ago

            There is definitely more value in hand made art. Even the fanciest prints on canvas can’t compare and I don’t think AI art will be evoking the same feelings a john waterhouse exhibit does any time soon.

            On the subject of publishing, I’ve chosen to embrace it personally. My view is that even the hidden stuff on our comp ends up in a Chinese or US databases anyways.

      • givesomefucks@lemmy.world
        link
        fedilink
        English
        arrow-up
        16
        arrow-down
        9
        ·
        10 months ago

        Because it’s easy to get these chatbots to output direct copyrighted text…

        Even ones the company never paid for, not even just a subscription for a single human to view the articles they’re reproducing. Like, think of it as buying a movie, then burning a copy for anyone who asks.

        Which reproducing word for word for people who didn’t pay is still a whole nother issue. So this is more like torrenting a movie, then seeding it.

        • burliman@lemmy.world
          link
          fedilink
          English
          arrow-up
          9
          arrow-down
          13
          ·
          10 months ago

          It’s not that easy, don’t believe the articles being broadcasted every day. They are heavily cherry picked.

          Also, if someone is creating copyright works, it is on that person to be responsible if they release or sell it, not the tool they used. Just because the tool can be good (learns well and responds well when asked to make a clone of something) doesn’t mean it is the only thing it does or must do. It is following instructions, which were to make a thing. The one giving the instructions is the issue, and the intent of that person when they distribute is the issue.

          If I draw a perfect clone of Donald Duck in the privacy of my home after looking at hundreds of Donald Duck images online, there is nothing wrong with that. If I go on Etsy and start selling them without a license, they will come after ME. Not because I drew it, but because I am selling it and violating a copyright. They won’t go after the pencil or ink manufacturer. And they won’t go after Adobe if I drew it on a computer with Photoshop.

          • givesomefucks@lemmy.world
            link
            fedilink
            English
            arrow-up
            15
            arrow-down
            5
            ·
            edit-2
            10 months ago

            If I draw a perfect clone of Donald Duck in the privacy of my home after looking at hundreds of Donald Duck images online, there is nothing wrong with that

            In your picture example it would be an exact copy…

            But even if you started a business and when people asked for a picture of Donald Duck, giving them a traced copy is still copyright infringement… Hell, even your bad analogy of a person’s own drawing, still copyright infringement

            The worst thing about these chatbots is the people who think it’s amazing don’t understand what it’s doing. If you understood it, it wouldn’t be impressive.

            • Grimy@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              3
              ·
              10 months ago

              You are missing his point. Is Disney going after the one who is selling the copy online, or are they going after Adobe?

              • givesomefucks@lemmy.world
                link
                fedilink
                English
                arrow-up
                2
                arrow-down
                2
                ·
                edit-2
                10 months ago

                In that analogy, openai is the one selling it, because their the ones using it to prop up their product.

                I didn’t think I needed to explicitly state that, but well, here we are.

                Have a nice life tho. I’m over accounts that stop replying to one thread of replies and then just go and reply to one of my other comments asking me to explain what I’ve already told them.

                Waaaay easier to just never see replies from that account

                • Grimy@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  arrow-down
                  2
                  ·
                  10 months ago

                  Some of us have to work for a living, I can’t reply to every comment the moment it comes in and it seems rude to break the chaine.

                  In his analogy, openais product was the tool. You can do the same with both img gen and Photoshop, and neither of these prop up their product by implying it’s easy to copyright infringe. That’s why I said you were missing his point but you do you buddy.

                  • webghost0101@sopuli.xyz
                    link
                    fedilink
                    English
                    arrow-up
                    2
                    arrow-down
                    1
                    ·
                    edit-2
                    10 months ago

                    That guy is total loss, a tragic clown as i am not sure if i should cry or laugh about them. I am not sure what businesses they had in this argument at all except pissing off people who are better informed.

                    I’ve only just saw the comment where he seemed to suggest that the final trained model contains all trained materials in full… and that combined with not once but multiple times pretending they know all about llm’s that foss ai doesnt exist and we its we that all dont know how any of this tech works… i seriously have to restrain myself to leave that gross misconception as it is as i don’t want them to respond. I hope the down-votes do their job.

                    I am sorry to vent, kinda just had to :)

      • TwilightVulpine@lemmy.world
        link
        fedilink
        English
        arrow-up
        6
        arrow-down
        3
        ·
        10 months ago

        Because humans have more rights than tools. You are free to look at copyrighted text and pictures, memorize them and describe them to others. It doesn’t mean you can use a camera to take and share pictures of it.

        Acting like every right that AIs have must be identical to humans’, and if not that means the erosion of human rights, is a fundamentally flawed argument.