Quote:
Originally Posted by borked
and yes, I've sped things up by 1000x
|
That's much better but still not good enough.
If on a much better server your script can go up to comparing 10mil pics/hour, that means you can run 10 comparisons/hour against a database of one million pics (pretty average db, many programs have that much and often more). That means ~250 comparisons/day - practically, that means you can check about 1 thread/day at pornbb or similar major boards. Or, in terms of posts, that's about 100 posts/ day (an average post has more than one preview attached).
To give you some idea of the size of the task, major boards like pornbb or saff boast 5-10K posts/day. Granted, not all of them contain any pictures to analyze (most of them are just "thanks"), but it is safe to assume that no less than 1K posts/day will contain some graphics to compare with the database.
That requires 10x more computing power than we have according to our best estimation. And you need to control at least a dozen of the major boards to make sure your stuff is not easy to find, hence 100x faster script is necessary. Which could do about 1 billion comparisons/hour.
I'm talking boards only because that's where most of the picture piracy takes place. Tubes and torrents mostly steal videos (although photo content is not uncommon for torrents too, boards host much more of it).