Warning with this technique (Google)

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • roganoli
    Confirmed User
    • Dec 2004
    • 757

    #1

    Warning with this technique (Google)

    I'm still an inexperienced webmaster...

    I was having good placements on Google for my new site... He was even making good sales!

    Suddenly something happened. My website disappeared from search! Today I found out what caused it... My site was totally copied by the proxy service Google! See the link below:

    https://www.google.com/#q=%22appspot...w=1280&bih=663

    I believe this was done intentionally by another webmaster and I believe I have solved this problem using my htaccess!

    But be warned for other amateurs like me!
  • Klen
    • Aug 2006
    • 32235

    #2
    Had same happening to me,totally copied site to three different domains.Which is why i have now in my settings protection against all kind of downloaders.

    Comment

    • roganoli
      Confirmed User
      • Dec 2004
      • 757

      #3
      I put it on my htaccess:

      RewriteCond %{HTTP_USER_AGENT} AppEngine [NC]
      RewriteRule .* - [F]

      Let me know if there is better solution...

      Comment

      • baryl
        Confirmed User
        • Aug 2011
        • 1086

        #4
        Wait..what? Can someone explain what this is?

        Comment

        • Klen
          • Aug 2006
          • 32235

          #5
          Search for article like this:
          http://www.htaccess-guide.com/blocki...-and-bad-bots/
          Had one with huge list of those downloaders,but dont know what was exact url.

          Comment

          • roganoli
            Confirmed User
            • Dec 2004
            • 757

            #6
            Originally posted by baryl
            Wait..what? Can someone explain what this is?
            Sorry if my English is not helpful...

            Comment

            • baryl
              Confirmed User
              • Aug 2011
              • 1086

              #7
              It's fine. Thanks for pointing that out and sorry about your site.

              Comment

              • RayBonga
                too cool for highschool
                • Nov 2005
                • 12164

                #8
                Originally posted by roganoli
                I put it on my htaccess:

                RewriteCond %{HTTP_USER_AGENT} AppEngine [NC]
                RewriteRule .* - [F]

                Let me know if there is better solution...
                Thanks for the warning What does this code do?

                Comment

                • brassmonkey
                  Pay It Forward
                  • Sep 2005
                  • 77396

                  #9
                  damn sorry to hear that
                  TRUMP 2026 KEKAW!!! - The Laken Riley Act Is Law!
                  DACA ENDED - SUPPORT AZ HCR 2060 52R - email: brassballz-at-techie.com

                  Comment

                  • redwhiteandblue
                    Bollocks
                    • Jun 2007
                    • 2793

                    #10
                    Originally posted by RayBonga
                    Thanks for the warning What does this code do?
                    It blocks access to any client which presents a user agent with the string "AppEngine" in it. But there are a whole load of others you need to block too, and that won't even stop all scrapers. I'd love to know the UA updowner.com is using and block that fucker.
                    Interserver unmanaged AMD Ryzen servers from $73.00

                    Comment

                    • medicatedghost
                      Confirmed User
                      • Jan 2012
                      • 209

                      #11
                      check out google.com/safebrowsing/diagnostic?site=updowner.com/
                      interesting analysis
                      Just a lowly webmaster trying to start (over) and if you could see to it that I got a fair shake from your algorithms - I'd be VERY happy to sign up with you and work my ass off producing pages that sell your product...........i can make a living with only a portion getting shaved off of my checks
                      google voice: (four-two-four) 210-7676
                      gmail: harry90717 @
                      twitter: uses gmail account
                      ICQ: 643932783

                      Comment

                      • jakeuk
                        Confirmed User
                        • Jul 2009
                        • 125

                        #12
                        The mod rewrite you suggested offer some protection. When I had similar problem a few years ago, here are other things I did in addition to the mod_rewrite:

                        1. Get strong links to your site so that your site is at least stronger than the copycat.
                        2. Rewrite some of the contents of your site so that its not a duplicate any more but a different version of the one that was copied.
                        3. Update you site regularly so that the copycat cannot keep up.
                        4. use a more aggressive exclusion robots.txt file, allowing ONLY the big 3 crawlers.

                        Comment

                        • rowan
                          Too lazy to set a custom title
                          • Mar 2002
                          • 17393

                          #13
                          Originally posted by redwhiteandblue
                          I'd love to know the UA updowner.com is using and block that fucker.
                          Any half decent scraper probably uses a common UA, and possibly a random selection of common UAs. I've seen loads of multiple pages from the same IP use a different UA each time.

                          Comment

                          Working...