schlunker
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
coyotino [he/him]@beehaw.org to Technology@beehaw.orgEnglish · 9 months ago

Microsoft and Reddit Are Fighting About Why Bing’s Crawler Is Blocked on Reddit

www.404media.co

external-link
message-square
35
fedilink
110
external-link

Microsoft and Reddit Are Fighting About Why Bing’s Crawler Is Blocked on Reddit

www.404media.co

coyotino [he/him]@beehaw.org to Technology@beehaw.orgEnglish · 9 months ago
message-square
35
fedilink
Reddit says that it doesn’t want companies scraping the site for AI. Microsoft says it’s not doing that.
  • coyotino [he/him]@beehaw.orgOP
    link
    fedilink
    English
    arrow-up
    59
    ·
    9 months ago

    The beef between Microsoft and Reddit came to light after I published a story revealing that Reddit is currently blocking every crawler from every search engine except Google, which earlier this year agreed to pay Reddit $60 million a year to scrap the site for its generative AI products.

    I know the author meant “scrape”, but sometimes it really does feel like AI is just scrapping the old internet for parts.

    • cybermass@lemmy.ca
      link
      fedilink
      arrow-up
      15
      ·
      9 months ago

      Yeah, aren’t like over half of reddit comments/posts by bots these days?

      • originalucifer@moist.catsweat.com
        link
        fedilink
        arrow-up
        13
        ·
        9 months ago

        yep, and the longer that happens the less value to the dataset. its becoming aged.

        • KeriKitty (They(/It))@pawb.social
          link
          fedilink
          English
          arrow-up
          13
          ·
          edit-2
          9 months ago

          [Joke] See, Reddit’s doing a nice thing here! They’re making sure nobody ends up toxifying their own dataset by using Reddit’s garbage heap of bot posts!

          • originalucifer@moist.catsweat.com
            link
            fedilink
            arrow-up
            5
            ·
            9 months ago

            google needs a checkbox of ‘ignore reddit’ im sick of having to manually add -reddit

            • The Cuuuuube@beehaw.org
              link
              fedilink
              English
              arrow-up
              13
              ·
              9 months ago

              Hey good news. Turns out you can use bing and not get back Reddit results

              • originalucifer@moist.catsweat.com
                link
                fedilink
                arrow-up
                3
                ·
                9 months ago

                yeah but then i get back bing results. no one needs that

            • i_am_not_a_robot@discuss.tchncs.de
              link
              fedilink
              English
              arrow-up
              3
              ·
              9 months ago

              There’s a browser extension for that. It also works on Pintrest and other useless sites. https://iorate.github.io/ublacklist/docs

Technology@beehaw.org

technology@beehaw.org

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@beehaw.org

A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.

Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.

Subcommunities on Beehaw:

  • Free and Open Source Software
  • Programming
  • Operating Systems

This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 686 users / day
  • 1.7K users / week
  • 3.29K users / month
  • 7.66K users / 6 months
  • 1 local subscriber
  • 38.6K subscribers
  • 4.02K Posts
  • 80.9K Comments
  • Modlog
  • mods:
  • alyaza [they/she]@beehaw.org
  • TheRtRevKaiser@beehaw.org
  • gyrfalcon@beehaw.org
  • rs5th@beehaw.org
  • coldredlight@beehaw.org
  • Leigh@beehaw.org
  • TheRtRevKaiser@kbin.social
  • Chris Remington@beehaw.org
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org