Archive for February, 2010

How to filter SPAMs on Twitter efficiently : TwitteRBL ?

Sunday, February 28th, 2010

I’m working on a service using the streaming API from Twitter, a great feature as it gives you instant access to Tweets. But then you get overloaded by Tweets, and because I’m looking for Tweets talking about money, I get lots of noise.

Looking at TwitBlock, I filtered out lots of it :

  1. Ignore tweets from recent users (if created less than 24 hours ago)
  2. Ignore tweets from users with default profile image
  3. Ignore if fewer than 10 followers
  4. Ignore if user description and name are blank
  5. Ignore if followers fewer than 100, and friends count is > (2*followers_count)
  6. Ignore if followers count over 100, and friends count is > (5*followers_count)
  7. Ignore if the user sends more than 20 tweets per day in average, since its creation

Some of these is working for me, but might not work for you at all. However, I think there could be a better way for a mass use. I ran mail servers for years, a very reliable way to handle this is to use DNSBL (also called RBL). You could have different RBLs for different use, and any twitter client could implement this very easily. Please note this could probably not work for Direct Messages except if Twitter grant specific access to the service, which they would probably never will.

twitterbl

Feedbag: feed discovery with title, feeds, and proper charset

Friday, February 12th, 2010

Just released http://gemcutter.org/gems/penso-feedbag to discover feeds for specific website, with proper charset, a fork from muck-feedbag, which was a fork from feedbag.

Yes, I love Ruby, Git, Gem and GitHub.


>> require 'feedbag'
>> Feedbag.find("http://lemonde.fr")[3]
=> #<struct Feedbag::Feed url="http://www.lemonde.fr/rss/sequence/0,2-3224,1-0,0.xml", title="Le Monde.fr: Société">

J’ai franchi le pas: passage au disque SSD

Friday, February 5th, 2010

Ça fait des années que j’entends dire qu’il faudrait que je passe aux disques SSD. C’est fait pour mon MacPro avec l’achat d’un SSD Intel X25-M v2. Le résultat ?

Le temps de boot passe de 76 secondes à 26 secondes. Le lancement de Firefox avec pleins d’onglets de 12 secondes à 3 ou 4 secondes, le lancement de Safari passe à moins de une seconde.

Rapide quoi…