Service that kills mirrors and scrapers

Collapse
X
 
  • Time
  • Show
Clear All
new posts
  • TrafficTitan
    Confirmed User
    • Nov 2012
    • 350

    #1

    Service that kills mirrors and scrapers

    Is there an affordable online company that kills people who are doing complete site copies and are scraping content to displace you from google search results?
    https://www.hentaicity.com/ | http://www.traffictitan.com/
  • sarettah
    see you later, I'm gone
    • Oct 2002
    • 14301

    #2
    Originally posted by TrafficTitan
    Is there an affordable online company that kills people who are doing complete site copies and are scraping content to displace you from google search results?
    I know a guy that says he will do that. Usually wants cash in small unmarked bills though.

    .
    All cookies cleared!

    Comment

    • pimpmaster9000
      Too lazy to set a custom title
      • Dec 2011
      • 26732

      #3
      Affordable and kill is a bad idea IMO...
      Report a suspicious cracker: Click Here

      Comment

      • TrafficTitan
        Confirmed User
        • Nov 2012
        • 350

        #4
        Kills the sites, not the person lol
        https://www.hentaicity.com/ | http://www.traffictitan.com/

        Comment

        • sarettah
          see you later, I'm gone
          • Oct 2002
          • 14301

          #5
          Originally posted by TrafficTitan
          Kills the sites, not the person lol
          Oh, that's a whole different story.



          .
          All cookies cleared!

          Comment

          • bronco67
            Too lazy to set a custom title
            • Dec 2006
            • 29032

            #6
            There's always going to be thieving motherfuckers and the internet just makes it 100 times easier to get away with.

            Comment

            • Bladewire
              StraightBro
              • Aug 2003
              • 56228

              #7
              Originally posted by TrafficTitan
              Is there an affordable online company that kills people who are doing complete site copies and are scraping content to displace you from google search results?
              You can protect your sites in .htaccess from scrapers. Are your sites on WP installs?

              Do you want me to paste some code here for you?


              Skype: CallTomNow

              Comment

              • Barry-xlovecam
                It's 42
                • Jun 2010
                • 18083

                #8
                It's really cat and mouse. UFW or iptables -- firewall them out -- if you have root. However, they will change IPs or AS networking so it is a never ending game.

                Comment

                • sarettah
                  see you later, I'm gone
                  • Oct 2002
                  • 14301

                  #9
                  Cloudflare can protect against some of the scrapers out there but just like anything, they can be beaten too.

                  .
                  All cookies cleared!

                  Comment

                  • rowan
                    Too lazy to set a custom title
                    • Mar 2002
                    • 17393

                    #10
                    Originally posted by Barry-xlovecam
                    It's really cat and mouse. UFW or iptables -- firewall them out -- if you have root. However, they will change IPs or AS networking so it is a never ending game.
                    I have a site that's scraped to hell and back. If you exclude Googlebot and all of the scrapers, there's probably less than 2% remaining (loads by a browser).

                    Over the years I've added bits and pieces to log various interesting information. The big red flag that sticks out, at least for my site: scrapers use proxies, so their IPs can change without notice, but the headers they send are usually a fixed pattern that is nothing like a real browser, so they're super easy to block.

                    Even a simple CAPTCHA that is triggered after say 10 loads without presenting a cookie manages to block most of them. Some IPs are constantly bashing at the site, day after day, even though they are almost perpetually 403'd or firewalled.

                    Guess there is a market for a service like this, if one doesn't exist... but integrating it into a customer's existing site would be interesting...

                    Comment

                    • Barry-xlovecam
                      It's 42
                      • Jun 2010
                      • 18083

                      #11
                      Originally posted by sarettah
                      Cloudflare can protect against some of the scrapers out there but just like anything, they can be beaten too.

                      .
                      Any other questions

                      Code:
                      </html>barry@paragon-DS-7:/media/barry/ADATA3$ dig NS gfy.com
                      
                      ; <<>> DiG 9.10.3-P4-Ubuntu <<>> NS gfy.com
                      ;; global options: +cmd
                      ;; Got answer:
                      ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 21089
                      ;; flags: qr rd ra; QUERY: 1, ANSWER: 2, AUTHORITY: 0, ADDITIONAL: 1
                      
                      ;; OPT PSEUDOSECTION:
                      ; EDNS: version: 0, flags:; udp: 512
                      ;; QUESTION SECTION:
                      ;gfy.com.			IN	NS
                      
                      ;; ANSWER SECTION:
                      gfy.com.		86399	IN	NS	june.ns.cloudflare.com.
                      gfy.com.		86399	IN	NS	theo.ns.cloudflare.com.
                      
                      ;; Query time: 56 msec
                      ;; SERVER: 127.0.0.1#53(127.0.0.1)
                      ;; WHEN: Sat Sep 02 12:42:15 EDT 2017
                      ;; MSG SIZE  rcvd: 88
                      =======================
                      
                      barry@paragon-DS-7:/media/barry/ADATA3$ curl -A "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.115 (Official Build) (64-bit)" "http://gfy.com/"
                      <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
                      <html xmlns="http://www.w3.org/1999/xhtml" dir="ltr" lang="en">
                      <head>
                      	<!-- no cache headers -->
                      	<meta http-equiv="Pragma" content="no-cache" />
                      	<meta http-equiv="Expires" content="-1" />
                      	<meta http-equiv="Cache-Control" content="no-cache" />
                      	<!-- end no cache headers -->
                      
                       	<meta name="robots" content="noarchive">
                      	
                      	<title>GFY Webmaster Board</title>

                      Comment

                      • freecartoonporn
                        Confirmed User
                        • Jan 2012
                        • 7683

                        #12
                        Originally posted by Barry-xlovecam
                        Any other questions

                        Code:
                        </html>barry@paragon-DS-7:/media/barry/ADATA3$ dig NS gfy.com
                        
                        ; <<>> DiG 9.10.3-P4-Ubuntu <<>> NS gfy.com
                        ;; global options: +cmd
                        ;; Got answer:
                        ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 21089
                        ;; flags: qr rd ra; QUERY: 1, ANSWER: 2, AUTHORITY: 0, ADDITIONAL: 1
                        
                        ;; OPT PSEUDOSECTION:
                        ; EDNS: version: 0, flags:; udp: 512
                        ;; QUESTION SECTION:
                        ;gfy.com.			IN	NS
                        
                        ;; ANSWER SECTION:
                        gfy.com.		86399	IN	NS	june.ns.cloudflare.com.
                        gfy.com.		86399	IN	NS	theo.ns.cloudflare.com.
                        
                        ;; Query time: 56 msec
                        ;; SERVER: 127.0.0.1#53(127.0.0.1)
                        ;; WHEN: Sat Sep 02 12:42:15 EDT 2017
                        ;; MSG SIZE  rcvd: 88
                        =======================
                        
                        barry@paragon-DS-7:/media/barry/ADATA3$ curl -A "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.115 (Official Build) (64-bit)" "http://gfy.com/"
                        <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
                        <html xmlns="http://www.w3.org/1999/xhtml" dir="ltr" lang="en">
                        <head>
                        	<!-- no cache headers -->
                        	<meta http-equiv="Pragma" content="no-cache" />
                        	<meta http-equiv="Expires" content="-1" />
                        	<meta http-equiv="Cache-Control" content="no-cache" />
                        	<!-- end no cache headers -->
                        
                         	<meta name="robots" content="noarchive">
                        	
                        	<title>GFY Webmaster Board</title>
                        what are you trying to show ?
                        SSD Cloud Server, VPS Server, Simple Cloud Hosting | DigitalOcean

                        Comment

                        • Barry-xlovecam
                          It's 42
                          • Jun 2010
                          • 18083

                          #13
                          That cloudflare is a submarine with screen doors?

                          Comment

                          • Bladewire
                            StraightBro
                            • Aug 2003
                            • 56228

                            #14
                            They need to tweak their cloudflare setting is all

                            Not sure if they changed their old IP afternoon cloudflare, as I suggested before, if not they're still wide open.

                            Edit: nope they didn't change it and they have 64 hostnames pointing to that same IP :/

                            Step 10 here - change IP after setup


                            Skype: CallTomNow

                            Comment

                            • Barry-xlovecam
                              It's 42
                              • Jun 2010
                              • 18083

                              #15
                              GFY IS using a Cloudflare IP

                              Luckily botnets are made of stupid computer users. Maybe, Cloudflare can control ddos spikes in traffic.

                              Cloudflare will not stop lone wolf hackers or scrapers. I just repeated that curl successfully from a Digital Ocean server I have in Amsterdam. So, the AS block is not checked at Cloudflare either.

                              Comment

                              • Bladewire
                                StraightBro
                                • Aug 2003
                                • 56228

                                #16
                                Originally posted by Barry-xlovecam
                                GFY IS using a Cloudflare IP

                                Luckily botnets are made of stupid computer users. Maybe, Cloudflare can control ddos spikes in traffic.

                                Cloudflare will not stop lone wolf hackers or scrapers. I just repeated that curl successfully from a Digital Ocean server I have in Amsterdam. So, the AS block is not checked at Cloudflare either.
                                You dope.

                                The GFY server IP is the same as when it was first attacked.

                                Then GFY hid behind cloudflare, with the same IP, and continued to go down.

                                So clearly the only reason the attack stopped was because the attacker stopped hitting the original IP.

                                I pay for cloudflare business , and use cloudflare. And I looked up GFY's IP.

                                I just PM'd it to you


                                Skype: CallTomNow

                                Comment

                                • freecartoonporn
                                  Confirmed User
                                  • Jan 2012
                                  • 7683

                                  #17
                                  Originally posted by Barry-xlovecam
                                  Any other questions

                                  Code:
                                  </html>barry@paragon-DS-7:/media/barry/ADATA3$ dig NS gfy.com
                                  
                                  ; <<>> DiG 9.10.3-P4-Ubuntu <<>> NS gfy.com
                                  ;; global options: +cmd
                                  ;; Got answer:
                                  ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 21089
                                  ;; flags: qr rd ra; QUERY: 1, ANSWER: 2, AUTHORITY: 0, ADDITIONAL: 1
                                  
                                  ;; OPT PSEUDOSECTION:
                                  ; EDNS: version: 0, flags:; udp: 512
                                  ;; QUESTION SECTION:
                                  ;gfy.com.			IN	NS
                                  
                                  ;; ANSWER SECTION:
                                  gfy.com.		86399	IN	NS	june.ns.cloudflare.com.
                                  gfy.com.		86399	IN	NS	theo.ns.cloudflare.com.
                                  
                                  ;; Query time: 56 msec
                                  ;; SERVER: 127.0.0.1#53(127.0.0.1)
                                  ;; WHEN: Sat Sep 02 12:42:15 EDT 2017
                                  ;; MSG SIZE  rcvd: 88
                                  =======================
                                  
                                  barry@paragon-DS-7:/media/barry/ADATA3$ curl -A "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.115 (Official Build) (64-bit)" "http://gfy.com/"
                                  <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
                                  <html xmlns="http://www.w3.org/1999/xhtml" dir="ltr" lang="en">
                                  <head>
                                  	<!-- no cache headers -->
                                  	<meta http-equiv="Pragma" content="no-cache" />
                                  	<meta http-equiv="Expires" content="-1" />
                                  	<meta http-equiv="Cache-Control" content="no-cache" />
                                  	<!-- end no cache headers -->
                                  
                                   	<meta name="robots" content="noarchive">
                                  	
                                  	<title>GFY Webmaster Board</title>
                                  i still cant figure out gfy ip from that, what am i missing ?


                                  thanks
                                  SSD Cloud Server, VPS Server, Simple Cloud Hosting | DigitalOcean

                                  Comment

                                  • Barry-xlovecam
                                    It's 42
                                    • Jun 2010
                                    • 18083

                                    #18
                                    BW I don't give a fuck how ...
                                    PM Eric and tell what DNS server that needs to have its records fixed.

                                    Comment

                                    Working...