{"id":1310,"date":"2009-02-06T02:22:29","date_gmt":"2009-02-06T07:22:29","guid":{"rendered":"http:\/\/www.bytebot.net\/blog\/?p=1310"},"modified":"2009-02-06T01:26:14","modified_gmt":"2009-02-06T06:26:14","slug":"facebook-lexicon-the-flu-and-data-mining","status":"publish","type":"post","link":"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining","title":{"rendered":"Facebook Lexicon, the flu, and data mining"},"content":{"rendered":"<p>I recently found out about the <a href=\"http:\/\/www.facebook.com\/lexicon\/\">Facebook Lexicon<\/a>. There&#8217;s a <a href=\"http:\/\/www.facebook.com\/help.php?topic=lexicon\">FAQ<\/a>, but in a nutshell, the Lexicon tracks and counts occurrences of words and phrases on Facebook Walls (profile, group, or even event Walls) over time. It doesn&#8217;t seem like status messages count, though maybe the <a href=\"http:\/\/www.facebook.com\/lexicon\/new\/\">new Lexicon<\/a> might in due time.<\/p>\n<p>Searched for &#8220;<a href=\"http:\/\/www.facebook.com\/lexicon\/index.php?q=the+flu\">the flu<\/a>&#8220;, only because I wanted to compare it with what you&#8217;d get over <a href=\"http:\/\/www.google.org\/flutrends\/\">Google Flu Trends<\/a>. Facebook doesn&#8217;t have the limitation that it has to be US only &#8211; its worldwide.<\/p>\n<p>Then I thought about Twitter search, since lots of people post their updates on life, their feelings, et al &#8211; look at the results there, for <a href=\"http:\/\/search.twitter.com\/search?q=the+flu\">the flu<\/a>. Look at the mashup the New York Times built for the <a href=\"http:\/\/www.nytimes.com\/interactive\/2009\/02\/02\/sports\/20090202_superbowl_twitter.html\">Superbowl on Twitter<\/a>. Are there graphing tools, that track keywords? It might actually be cool.<\/p>\n<p>Lots of new ways to data mine, it seems. Google shares some <a href=\"http:\/\/www.google.org\/about\/flutrends\/download.html\">semblance of raw data<\/a>. Facebook doesn&#8217;t. Twitter has whatever is available, that is limited by its API (what, some 3,200 entries?). <\/p>\n<p>Imagine all this being used to predict flu clusters, or something more close to home, <a href=\"http:\/\/en.wikipedia.org\/wiki\/Dengue\">dengue<\/a> clusters. Or voter turnout (status saying &#8220;voted&#8221;, even). <\/p>\n<div class=\"sharedaddy sd-sharing-enabled\"><div class=\"robots-nocontent sd-block sd-social sd-social-icon-text sd-sharing\"><h3 class=\"sd-title\">Share this:<\/h3><div class=\"sd-content\"><ul><li class=\"share-email\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-email sd-button share-icon\" href=\"mailto:?subject=%5BShared%20Post%5D%20Facebook%20Lexicon%2C%20the%20flu%2C%20and%20data%20mining&body=http%3A%2F%2Fwww.bytebot.net%2Fblog%2Farchives%2F2009%2F02%2F06%2Ffacebook-lexicon-the-flu-and-data-mining&share=email\" target=\"_blank\" title=\"Click to email a link to a friend\" data-email-share-error-title=\"Do you have email set up?\" data-email-share-error-text=\"If you&#039;re having problems sharing via email, you might not have email set up for your browser. You may need to create a new email yourself.\" data-email-share-nonce=\"02e34b6448\" data-email-share-track-url=\"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining?share=email\"><span>Email<\/span><\/a><\/li><li class=\"share-facebook\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-facebook-1310\" class=\"share-facebook sd-button share-icon\" href=\"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining?share=facebook\" target=\"_blank\" title=\"Click to share on Facebook\" ><span>Facebook<\/span><\/a><\/li><li class=\"share-linkedin\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-linkedin-1310\" class=\"share-linkedin sd-button share-icon\" href=\"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining?share=linkedin\" target=\"_blank\" title=\"Click to share on LinkedIn\" ><span>LinkedIn<\/span><\/a><\/li><li class=\"share-twitter\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-twitter-1310\" class=\"share-twitter sd-button share-icon\" href=\"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining?share=twitter\" target=\"_blank\" title=\"Click to share on Twitter\" ><span>Twitter<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>I recently found out about the Facebook Lexicon. There&#8217;s a FAQ, but in a nutshell, the Lexicon tracks and counts occurrences of words and phrases on Facebook Walls (profile, group, or even event Walls) over time. It doesn&#8217;t seem like status messages count, though maybe the new Lexicon might in due time. Searched for &#8220;the [&hellip;]<\/p>\n<div class=\"sharedaddy sd-sharing-enabled\"><div class=\"robots-nocontent sd-block sd-social sd-social-icon-text sd-sharing\"><h3 class=\"sd-title\">Share this:<\/h3><div class=\"sd-content\"><ul><li class=\"share-email\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"\" class=\"share-email sd-button share-icon\" href=\"mailto:?subject=%5BShared%20Post%5D%20Facebook%20Lexicon%2C%20the%20flu%2C%20and%20data%20mining&body=http%3A%2F%2Fwww.bytebot.net%2Fblog%2Farchives%2F2009%2F02%2F06%2Ffacebook-lexicon-the-flu-and-data-mining&share=email\" target=\"_blank\" title=\"Click to email a link to a friend\" data-email-share-error-title=\"Do you have email set up?\" data-email-share-error-text=\"If you&#039;re having problems sharing via email, you might not have email set up for your browser. You may need to create a new email yourself.\" data-email-share-nonce=\"02e34b6448\" data-email-share-track-url=\"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining?share=email\"><span>Email<\/span><\/a><\/li><li class=\"share-facebook\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-facebook-1310\" class=\"share-facebook sd-button share-icon\" href=\"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining?share=facebook\" target=\"_blank\" title=\"Click to share on Facebook\" ><span>Facebook<\/span><\/a><\/li><li class=\"share-linkedin\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-linkedin-1310\" class=\"share-linkedin sd-button share-icon\" href=\"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining?share=linkedin\" target=\"_blank\" title=\"Click to share on LinkedIn\" ><span>LinkedIn<\/span><\/a><\/li><li class=\"share-twitter\"><a rel=\"nofollow noopener noreferrer\" data-shared=\"sharing-twitter-1310\" class=\"share-twitter sd-button share-icon\" href=\"http:\/\/www.bytebot.net\/blog\/archives\/2009\/02\/06\/facebook-lexicon-the-flu-and-data-mining?share=twitter\" target=\"_blank\" title=\"Click to share on Twitter\" ><span>Twitter<\/span><\/a><\/li><li class=\"share-end\"><\/li><\/ul><\/div><\/div><\/div>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"jetpack_publicize_message":"","jetpack_is_tweetstorm":false,"jetpack_publicize_feature_enabled":true,"jetpack_social_options":[]},"categories":[44],"tags":[712,297,713,714,150,715,392],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/p4vJD-l8","jetpack_sharing_enabled":true,"jetpack-related-posts":[{"id":2832,"url":"http:\/\/www.bytebot.net\/blog\/archives\/2013\/11\/30\/groonga-fulltext-search-library-for-cloud-web","url_meta":{"origin":1310,"position":0},"title":"groonga &#8211; fulltext search library for cloud &#038; web","date":"30\/11\/2013","format":false,"excerpt":"This is an incomplete fragment from 2011. Figure its worth publishing this now, considering MariaDB is likely to get groonga in the near future. The groonga team have released MariaDB 10.0.6 binaries as well. This is all part of the mroonga\u00a0project. These were my quick notes from the groonga talk\u2026","rel":"","context":"In &quot;MariaDB&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":2435,"url":"http:\/\/www.bytebot.net\/blog\/archives\/2012\/07\/21\/google-plus-is-missing-opportunities","url_meta":{"origin":1310,"position":1},"title":"Google Plus is missing opportunities","date":"21\/7\/2012","format":false,"excerpt":"When I'm in the USA, if I get the time, I do like to consume some television. I'm an odd person - I'm usually watching the advertisements more than the television shows themselves. And the promotions that surround shows. Its very common for advertising for products to have several logos\u2026","rel":"","context":"In &quot;General&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":2366,"url":"http:\/\/www.bytebot.net\/blog\/archives\/2012\/04\/10\/twitter-facebook-mysql-trees-online-pushing-mysql-forward","url_meta":{"origin":1310,"position":2},"title":"Twitter, Facebook MySQL trees online &#8211; pushing MySQL forward","date":"10\/4\/2012","format":false,"excerpt":"Just yesterday, I'm sure many saw Twitter opensourcing their MySQL implementation. It is based on MySQL 5.5 and the code is on Github. For reference, the database team at Facebook has always been actively blogging, and keeping up their code available on Launchpad. Its worth noting that the implementation there\u2026","rel":"","context":"In &quot;MySQL&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":2209,"url":"http:\/\/www.bytebot.net\/blog\/archives\/2011\/11\/13\/the-social-media-page-craze-google-facebook-twitter-linkedin","url_meta":{"origin":1310,"position":3},"title":"The Social Media Page Craze: Google+, Facebook, Twitter, LinkedIn","date":"13\/11\/2011","format":false,"excerpt":"Pages. They are becoming very popular. If you're a brand, you've got to keep track of these things. This is sort of a dump of my thoughts on this. It was quite common in the day to get a Twitter page. Multiple people can update a Twitter page. There are\u2026","rel":"","context":"In &quot;General&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":2323,"url":"http:\/\/www.bytebot.net\/blog\/archives\/2012\/03\/13\/special-data-plans-provide-a-mountain-to-climb-for-startups","url_meta":{"origin":1310,"position":4},"title":"Special data plans provide a mountain to climb for startups","date":"13\/3\/2012","format":false,"excerpt":"Via NYT: Days Are Numbered for Unlimited Mobile Data Plans In Indonesia, nearly a third of the population is younger than 15 years old. So Telkomsel, the leading mobile operator in the country, offers a data plan called FlexiChatting for customers who want to do just one thing: gain access\u2026","rel":"","context":"In &quot;General&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]},{"id":3199,"url":"http:\/\/www.bytebot.net\/blog\/archives\/2016\/04\/08\/tweet-summary-of-percona-live-2015","url_meta":{"origin":1310,"position":5},"title":"(tweet) Summary of Percona Live 2015","date":"8\/4\/2016","format":false,"excerpt":"The problem with Twitter is that we talk about something and before you know it, people forget. (e.g. does WebScaleSQL have an async client library?) How many blog posts are there about Percona Live Santa Clara 2015? This time (2016), I'm going to endeavour to write more than to just\u2026","rel":"","context":"In &quot;MySQL&quot;","img":{"alt_text":"","src":"","width":0,"height":0},"classes":[]}],"amp_enabled":true,"_links":{"self":[{"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/posts\/1310"}],"collection":[{"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/comments?post=1310"}],"version-history":[{"count":1,"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/posts\/1310\/revisions"}],"predecessor-version":[{"id":1311,"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/posts\/1310\/revisions\/1311"}],"wp:attachment":[{"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/media?parent=1310"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/categories?post=1310"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/www.bytebot.net\/blog\/wp-json\/wp\/v2\/tags?post=1310"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}