Short answer
In short, they developed a web browser designed to scan the Internet very efficiently from many computer environments (but which can also run on the same computer).
, .
, , , .
Hadoop, Java , MapReduce. MapReduce - , Google .
MapReduce/Hadoop, , , , - ( ).
wikipedia MapReduce.
, Node, () ( ), , .
, Node - ( -), ( ) .
, , .
4 :
*
-, , : "-".
Node ( , ).
- :
- ? , .
- URL-, "http://www.google.com/" "http://www.google.com/../" -.
- - .
( -, , )
topN (, 10 ) -.
* Fetch
URL- -, , .
Slaves URL , -, .
URL- HTML- - .
*
- -, , .
, , .
- -.
*
- -, .
URL-, ( ) URL- ( - ).
- .
.
Repeat
, - -. , , . .
MapReduce - .
, .
, , . .
, , . .
MapReduce :
Mapper, Partitioner Reducer.
MapReduce , , kila. (, Mega-).