• j4k3@lemmy.world
    link
    fedilink
    arrow-up
    5
    ·
    1 year ago

    I’m willing to bet the lack of api access going forward will make all reddit posts disappear from crawler results anyways. I’m no expert, but I imagine the crawler is picking up on all of the interconnected references to reddit that are all due to free api access. As soon as those connections disappear, so dies the value to the entire community. It will be just like the garbage results we get from every single source now. This is the path of neo digital feudalism.

    • jamesravey@lemmy.nopro.be
      link
      fedilink
      arrow-up
      6
      ·
      1 year ago

      API calls are almost always private between the caller and the endpoint (think telegram bots or mobile apps). There isn’t really a technically feasible way for a crawler to somehow “infer” any kind of knowledge of how api calls are being used unless the result has some kind of publically visible side effect (E. G. The program using the api is generating a web page and uploading it somewhere crawlable). Google et Al go by how many links from other pages to the page of interest exist (inbound links) and multiply by a smattering of other things like quality of keywords, length of content etc.

      That said, if you’re implying that the api changes mean that:

      • people are less likely to use reddit because they can’t access it via RIF/Apollo
      • less useful content is added to the site to be indexed,
      • fewer inbound links will be generated that point to existing posts
      • pages stagnate and drop in ranking

      That is a plausible concern.

      • j4k3@lemmy.world
        link
        fedilink
        arrow-up
        5
        ·
        edit-2
        1 year ago

        fewer inbound links will be generated that point to existing posts

        pages stagnate and drop in ranking

        This is what I mean, the external references people had in the periphery will dry up. Like if I’m not using Infinity to generate better refined search results, now I don’t post the link to Stack Exchange, and this reference fails to cascade across various copy paste blog resources. Now the original reddit post is a dead end source with no external weighted reference value. It’s all of these advanced features implemented in the periphery using the free API that create the usefulness in the first place.

        Searching reddit will be just like YouTube searches now. No matter what technical wording you use, you’ll never find technical references again. I can type the title of a video on YT verbatim and still won’t get the correct results, but I can log into an old account and find the content in my hundreds of playlists I kept as references. It is still there, it is still public.