<body><script type="text/javascript"> function setAttributeOnload(object, attribute, val) { if(window.addEventListener) { window.addEventListener('load', function(){ object[attribute] = val; }, false); } else { window.attachEvent('onload', function(){ object[attribute] = val; }); } } </script> <div id="navbar-iframe-container"></div> <script type="text/javascript" src="https://apis.google.com/js/plusone.js"></script> <script type="text/javascript"> gapi.load("gapi.iframes:gapi.iframes.style.bubble", function() { if (gapi.iframes && gapi.iframes.getContext) { gapi.iframes.getContext().openChild({ url: 'https://www.blogger.com/navbar.g?targetBlogID\x3d6418452\x26blogName\x3dFootsteps+on+Clouds\x26publishMode\x3dPUBLISH_MODE_BLOGSPOT\x26navbarType\x3dBLACK\x26layoutType\x3dCLASSIC\x26searchRoot\x3dhttp://chirayu.blogspot.com/search\x26blogLocale\x3den_US\x26v\x3d2\x26homepageUrl\x3dhttp://chirayu.blogspot.com/\x26vt\x3d7754879049997020549', where: document.getElementById("navbar-iframe-container"), id: "navbar-iframe" }); } }); </script>

Thursday, September 28, 2006

Reading


- Information retrieval based on historical data
- Google Patent Application
- SEO by the Sea
- IPLists

Papers on Spam in search engine results (PDFs)

- Paper on detecting spam
- Paper on finding spam

What is the size of English language content on the World Wide Web?

Search for a common English word that is likely to be found on any Web page. Examples: the, of, to, and, a, in, is, it, you, or that.
The average of the number of Web pages found for these searches comes to ~12.5bn.
Add Web pages in other languages, images, news, groups, blogs, rss feeds, atom feeds, maps, local listings, directories, and other multimedia content and as a rough estimate the total size of searchable content on the Internet would be ~18bn-19bn documents, that's ~20 times the population of India!