How Do You Stop Fucking Baido Indexing Sites?..

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • EddyTheDog
    Just Doing My Own Thing
    • Jan 2011
    • 25433

    #1

    How Do You Stop Fucking Baido Indexing Sites?..

    It ignores robots.txt - Anyone know what IPs it uses?....
  • Magnetron
    Lord High Groundhog
    • Jun 2013
    • 1841

    #2
    What do you have against Scott Baio?
    .
    Approach the mirrored reflection saying his name three times : "Butcher .... Butcher .... Butcher ....."
    and wait to see if this Bogeyman urban legend manifests in the background, looming over shoulder
    While your neighbors were busy killing off everyone in the neighborhood
    with your own butcher knife in hand concealed behind your back
    you stood for
    ever before the window saying
    nothing

    Comment

    • WDF
      Confirmed User
      • Jan 2013
      • 2248

      #3
      Don't you like all that CN traffic?
      Please HELP

      Comment

      • freecartoonporn
        Confirmed User
        • Jan 2012
        • 7683

        #4
        here it is .

        Stop Baidu crawler
        SSD Cloud Server, VPS Server, Simple Cloud Hosting | DigitalOcean

        Comment

        • brassmonkey
          Pay It Forward
          • Sep 2005
          • 77396

          #5
          fucking racist!!







































          TRUMP 2026 KEKAW!!! - The Laken Riley Act Is Law!
          DACA ENDED - SUPPORT AZ HCR 2060 52R - email: brassballz-at-techie.com

          Comment

          • bean-aid
            So Fucking Banned
            • Jun 2011
            • 16493

            #6
            Just have host blacklist china traffic.

            Comment

            • ErectMedia
              Confirmed Chicago Pimp
              • Aug 2004
              • 7100

              #7
              robots.txt is like asking your neighbor to keep his dog off your lawn, htaccess is like installing an electric fence

              Comment

              • rowan
                Too lazy to set a custom title
                • Mar 2002
                • 17393

                #8
                Originally posted by beaner
                Just have host blacklist china traffic.
                Make sure they have clue before they do that, and don't end up blacklisting all of Asia Pacific.

                My own solution to the problem is to firewall any IP that presents a Baidu user-agent.

                Comment

                • fuzebox
                  making it rain
                  • Oct 2003
                  • 22351

                  #9
                  I would never turn down free traffic. If you don't want china leeching your resources, redirect it somewhere useful

                  Comment

                  • rowan
                    Too lazy to set a custom title
                    • Mar 2002
                    • 17393

                    #10
                    Originally posted by fuzebox
                    I would never turn down free traffic. If you don't want china leeching your resources, redirect it somewhere useful
                    Dunno about the OP but I'm assuming that he's in the same boat as I am - the issue is that the Baidu web spider trawls over the whole site, but never actually sends any (or very little) human traffic.

                    If your site has a decent number of pages and/or it is dynamically generated then Baidu really is just wasting resources.

                    Comment

                    • Phoenix
                      BACON BACON BACON
                      • Nov 2002
                      • 35475

                      #11
                      i might be interested in taking all chinese traffic.
                      Telegram PhoenixBrad
                      https://quantads.io

                      Comment

                      • FlowerKid
                        Confirmed User
                        • Sep 2005
                        • 1045

                        #12
                        If baidu spider teaffic is relevant for your server performance, maybe it's time to upgrade.

                        Comment

                        • hdbuilder
                          Confirmed User
                          • Jun 2012
                          • 1338

                          #13
                          Just put this in your htaccess file in the root of each domain:

                          # Block bad spiders
                          RewriteEngine On
                          RewriteCond %{HTTP_USER_AGENT} Sosospider [NC,OR]
                          RewriteCond %{HTTP_USER_AGENT} Baiduspider [NC,OR]
                          RewriteCond %{HTTP_USER_AGENT} Sogou
                          RewriteRule ^.* - [F,L]

                          You can add as many as you want , make sure the lines ends with [NC,OR] and with nothing for the last one

                          Using it for years and been tested ...

                          ROBO SCRIPTS | WP CAM PLUGIN - Scripts To Promote Cam Sites - Chaturbate, BongaCams, Streamate, LiveJasmin, Stripchat...

                          The Cam Site Builder, The Cam Multi Site Builder -> MULTIPLE CAM SITES IN ONE

                          Comment

                          • EddyTheDog
                            Just Doing My Own Thing
                            • Jan 2011
                            • 25433

                            #14
                            Originally posted by hdbuilder
                            Just put this in your htaccess file in the root of each domain:

                            # Block bad spiders
                            RewriteEngine On
                            RewriteCond %{HTTP_USER_AGENT} Sosospider [NC,OR]
                            RewriteCond %{HTTP_USER_AGENT} Baiduspider [NC,OR]
                            RewriteCond %{HTTP_USER_AGENT} Sogou
                            RewriteRule ^.* - [F,L]

                            You can add as many as you want , make sure the lines ends with [NC,OR] and with nothing for the last one

                            Using it for years and been tested ...
                            Thanks - That looks like it will do the trick...

                            The main reason is that it inflates my traffic to sponsors so much its hard to see what the real conversions are - If you are as into stats as me it is a real pain in the ass...

                            I am moving towards using GeoIp scripts and sorting traffic that way but it takes time.....

                            Comment

                            Working...