Shitty sites have had copyscape protected badges on them for ages.
Not sure if it still does but it used to work on 3 word samples of the html, ie you can have text and change part of every 3rd word to code and it's unique again.
Code:
This text is unique.
This text is unique.