Google Image Search could be used in many ways.
1. If you want to know if a person is a man or a woman and the name doesn’t help, do a search for the name.
2. If you don’t know the meaning of a word, the pictures may help you.
3. A better search for Flickr. Google uses information from other sites that link to Flickr photos, so you may find Google search better.
4. Find what’s interesting about a site, by looking at the pictures included. For example: wired.com.
5. Find a new wallpaper for your desktop by restricting your search to large images. You can automate this using an application.
6. Find random personal pictures, using standard file names from digital cameras.
7. Type the name of a painter and you can take an art class.
8. Install a Greasemonkey script so you can view the original version of the image directly by clicking on the thumbnail.
9. Find the color of a word. “Word Color is a windows program that uses Google Image Search to determine the color of a word or string of words. It goes out there, retrieves the top 9 images and loops through all pixels, calculating the average hue, which is later converted to a color.”
10. If you want to grab search results, GoogleGrab is a tool that downloads images from Google Image Search. It even supports batch search.
A robots.txt file is a short text file that resides in your home directory. Before search engines spider your site, they look into this file to see which files/file types and/or directories they are not allowed to see. This tool will make your robots.txt file. As a little ‘extra’ it also includes a block to many unwanted spiders that only crawl your site in order to collect the email addresses stored on your pages.(SPAM-bots)
Allowing all bots
Your basic robots.txt file (what you likely want to use if you haven’t been using one until now) will allow all bots to index all pages of your site. The * is generic to refer to all bots.
User-agent: * Disallow:
Disallowing Specific Pages & Directories
You may want to not allow bots to index certain pages or directories on your site. Do keep in mind that if you have any private directories that are listed in your robots.txt that you should password protect them. Otherwise you have just alerted rogue bots (that ignore contents of your robots.txt file or specifically target disallowed pages and directories) as well as any competitors to your private directories. In these cases, you should disallow robots using meta tags instead.
cgi-bin is a common directory to disallow, since it normally contains scripts, many that don’t need indexing and which can go through a lot of bandwidth as a result.
User-agent: * Disallow: /cgi-bin/ Disallow: /privatedirectory/ Disallow: /directory/largepage.htm
Once it is completed, save it as robots.txt and upload it to your server.
http://www.example.com/robots.txt