Skip to main content

How do search engine works


By Vinod Kumar ( 26/01/2018 )


For many, Google is the internet. It's the starting point for finding new sites, and is arguably the most important invention since the internet itself. Without search engines, new web content would be inaccessible to the masses.

But do you know how search engines work?
Every search engine has three main functions: crawling (to discover content), indexing (to track and store content), and retrieval (to fetch relevant content when users query the search engine).


Crawling

Crawling is where it all begins: the acquisition of data about a website.
This involves scanning sites and collecting details about each page: titles, images, keywords, other linked pages, etc. Different crawlers may also look for different details, like page layouts, where advertisements are placed, whether links are crammed in, etc.

But how is a website crawled? An automated bot (called a "spider") visits page after page as quickly as possible, using page links to find where to go next. Even in the earliest days, Google's spiders could read several hundred pages per second. Nowadays, it's in the thousands.
When a web crawler visits a page, it collects every link on the page and adds them to its list of next pages to visit. It goes to the next page in its list, collects the links on that page, and repeats. Web crawlers also revisit past pages once in a while to see if any changes happened.
This means any site that's linked from an indexed site will eventually be crawled. Some sites are crawled more frequently, and some are crawled to greater depths


Indexing

Indexing is when the data from a crawl is processed and placed in a database.
Imagine making a list of all the books you own, their publishers, their authors, their genres, their page counts, etc. Crawling is when you comb through each book while indexing is when you log them to your list.
Now imagine it's not just a room full of books, but every library in the world.


Retrieval and Ranking

Retrieval is when the search engine processes your search query and returns the most relevant pages that match your query.
Most search engines differentiate themselves through their retrieval methods: they use different criteria to pick and choose which pages fit best with what you want to find. That's why search results vary between Google and Bing, 
Ranking algorithms check your search query against billions of pages to determine each one's relevance. Companies guard their ranking algorithms as patented industry secrets due to their complexity. A better algorithm translates to a better search experience.
Search engine exploitation is possible, of course, but isn't so easy anymore.
Originally, search engines ranked sites by how often keywords appeared on a page, which led to "keyword stuffing" --- filling pages with keyword-heavy nonsense.
Then came the concept of link importance: search engines valued sites with lots of incoming links because they interpreted site popularity as relevance. But this led to link spamming all over the web. Nowadays, search engines weight links depending on the "authority" of the linking site. Search engines put more value on links from a government agency than links from a link directory.


For more tech videos, Subscribe to our channel

Comments

Techvinu said…
This comment has been removed by the author.

Popular posts from this blog

      Voting machine using Arduino We all are quite familiar with voting machines, even we have covered few other electronic voting machine projects previously aan  using RFID and AVR microcontroller. In this project, we have used the arduino controller to create an  electronic voting machine Electronic voting machine has now replaced the traditional mechanism of ballot voting due to several advantages like security, automatic counting etc. The system consists of two units – the control unit and the user unit. The control unit consists of some control switches and status LED’s, and is handled by the presiding officer. The user unit provides voting facility and contains a matrix keypad, a memory IC and an LCD display. The system operates in three modes – the Idle mode, Voting mode and Counting mode. Each mode is identified by a status byte written in the EEPROM. In Ideal mode the machine is idle, that means the machine is ready to use. When the presiding officer press the ST
Aravinda sametha review Movie starts with sunil Aravindha Sametha show started with a flash back episode of faction story in Kommaddi. Sunil takes us back to the flash back. First movie started with rayalasima backdrop Hero intro started After 10 min flight scene started which was leaked previously  First fight and Tarak’s shirtless entry fight has enough high moments for fans, but has a high dose of violence. Just now jadapathi babu introduced as villain  Nagababu is a father of ntr  It's very emotion scene with nagababu who was died at fight scene First song is ekonalo cherinado   In “ Ram Rudhiram “ ( pathos song ) taking , trivikram has shown his philosophical side First twenty minutes runs on a highly emotional note. Veera Raghava’s (NTR) entry followed by a high action block and ‘Yeda Poyinado’ song Naga Babu son, Veera (NTR) comes to his village. Twist to Naga Babu’s character. NTR’s stylish look, Mass Story, High Octane Actio
 100 things Google announced at I/O'19 Another I/O is in the books! We played in sandboxes, watched eye-popping product demos and  listened to AI-powered music . But the fun isn’t over! In case you missed it, here are 100 announcements we made at I/O: Hardware 1.  Hold the phone! Our new smartphones— the Pixel 3a and Pixel 3a XL —hit the shelves this week, bringing together all the essential Google features at a lower price ($399 for the 5.6-inch display and $479 for the 6-inch model). 2.  Good things come in threes, like Pixel 3a’s color options. Choose from Purple-ish, Clearly White and Just Black. 3.  And no matter what color your phone is, it has the same great Pixel camera. Capture shots in portrait mode and HDR+, or use Night Sight to take magical photos in low light (think outdoor concerts, swanky restaurants or night hikes with friends). 4.  To add to the creativity, Time Lapse is coming to Pixel 3a. Soon you can capture an entire sunset within a fe