php - How to block a website from crawling my site without knowing their IP address -


there's spam site exact replica of site. continuously crawl site , literally update / add content within 20 min (literally 30k+ urls). after research, i'm positive they're crawling site , storing on server.

they use cloudflare makes can't know true ip address. can somehow block them crawling site (via .htaccess or something) knowing domain name?

it's entirely possible server run crawling script separate server host clone on, if weren't using cloud flare.

however, if they're crawling content, should pretty obvious in server's access logs. if don't know are, talk hosting provider. common ip addresses listed, , try blocking them this:

order allow,deny allow deny x.x.x.x 

Comments

Popular posts from this blog

android - MPAndroidChart - How to add Annotations or images to the chart -

javascript - Add class to another page attribute using URL id - Jquery -

firefox - Where is 'webgl.osmesalib' parameter? -