淘客熙熙

主题:【原创】Google In My Eyes (1) -- 林木森森

共:💬14
全看分页树展 · 主题
家园 【原创】Google In My Eyes (1)

Google is definitely a king in search market now, it holds triumph. First of all, I would like to show you some data to give you a overall picture:

1. minimum 1,000 queries/sec,

2. 200 million search/day

3. nearly 100,000 servers

4. index 3 billion web pages

5. 4 petabytes disk storage

the sheer volume of data is astonishing, the impact is profound at 21st century information age.

Internet started a revolution, the search would be essential in the future in our society. Here are something about google:

1. its search technology. Here comes some concerns to return quality search result. The size of index, the algorithm to rank relevancy based upon query interpretation, ..etc. The secret ingredient is google’s page rank algorithm here. Actually link analysis isn’t anything break-though, It came from IBM research lab first. To put this computer jargon in plain English it is like popularity contest, if web site has most links from other relevant sites, it would be considered popular. It has its merits I think to some extents. So far, google does pretty decent job in relevancy, but better has come yet. Certainly, a bunch of people are trying hard to hack its algorithm to be on top rank, bogus link farm was one of tactics. On current playground, there are other upstarts which are willing to challenge the crown too, such as teoma, which is pretty impressive to me. Certainly, it is only getting hotter in search market.

2. its infrastructure. Being responsive is always top priority to search business. Google comes its own way. it has been stacking inexpensive servers on the rack, writing software to manage them. At its volume and traffic, there is no out-of-shelf solution, it rolled out its own distributed file system, automation management system, cluster system …etc. Its infrastructure is unbelievable, just imagine dealing with those 100,000+ severs, it would make people dizzy. So, they put in 110% severs in service, if 10% is down, there are still functioning.

3. its culture. It has been pursuing perfection. Its simplicity of user interface is definitely attractive comparing those clustered commercial advertising directory. It does one thing, but does really well. No bull at all. It is serious about what they are doing. It is doing things with its own way, like its IPO process. Certainly, something I feel paranoid with its style, like Gmail, why does it have to select “small group” to start with? Where is fairness in this play? The same comes to their social networking “orkut” baby, why does it start with randomly picked 2,000 people from the world? I guess they just are being different.


本帖一共被 3 帖 引用 (帖内工具实现)
全看分页树展 · 主题


有趣有益,互惠互利;开阔视野,博采众长。
虚拟的网络,真实的人。天南地北客,相逢皆朋友

Copyright © cchere 西西河