Readers: 62 | Updated: 03-26

How Google Sets Works

Translate Into:

A tool from Google that is often overlooked is Google Sets, which allows you to “automatically create sets of items from a few examples.”

Google Sets was one of the first applications in the Google Labs pages.

Those pages are “Google’s Technology Playground,” and contain a number programs that may or may not be tomorrow’s useful applications from the search engine. As Google tells us,

Google labs showcases a few of our favorite ideas that aren’t quite ready for prime time. Your feedback can help us improve them. Please play with these prototypes and send your comments directly to the Googlers who developed them.

Google was granted a patent this week on the process behind Google Sets, and the patent document provides some details on how the program finds additional words based on “items from a set of things” that you enter.

I haven’t used Google Sets much in the past, but now that I have a sense of how it works, I might use it more often.

Since the program allows you to enter a number of items that might be members of a set, I decided to type in the names of 4 cities in Delaware:

Newark, Dover, Wilmington and Georgetown

You can then choose to get a small set, or a large set in response to the items that you chose. Here are the results that I received after picking a large set:

wilmington, georgetown, newark, dover, new castle, rehoboth beach, bear, milford, hockessin, lewes, seaford, smyrna, millsboro, middletown, claymont, milton, selbyville, townsend, laurel, harrington, felton, greenwood, clayton, magnolia, camden, wyoming, dover afb, odessa, elsmere, delaware city, bethany beach, dewey beach, ocean view, fenwick island, bridgeville, newport, montchanin, ellendale, brookside, dagsboro, millville, winterthur, saint georges, philadelphia, delmar, yorklyn, glasgow, frankford, lincoln, port penn

Most, but not all of these results, are cities in Delaware.

Not all sets received provide such good results, but if you have an idea of how Google Sets works, you may end up with better results when using the tool.

The simple explanation of how the program works is that Google attempts to identify lists on the web as it crawls pages. It may look for these lists by considering:

  • HTML tags (e.g., <UL>, <OL>, <DL>, <H1>-<H6> tags).
  • Items placed in a table,
  • Items separated by commas or semicolons,
  • Items separated by tabs.
  • Other ways.

Items typed into the Google Sets interface by users are matched up against these lists, and probabilities are calculated to determine which items might be a good match for the items submitted by someone using Google Sets.

If you keep in mind that Google Sets is suggesting additional terms for your set by considering words that might appear together in lists on Web pages, you may find the results you receive more useful.

The patent is:

System and methods for automatically creating lists
Invented by Simon Tong and Jeff Dean
Assigned to Google
US Patent 7,350,187
Granted March 25, 2008
Filed: April 30, 2003

Abstract

A system automatically creates a list from items in existing lists. The system receives one or more example items corresponding to the list and assigns weights to the items in the existing lists based on the one or more example items. The system then forms the list based on the items and the weights assigned to the items.

While the Google Sets application isn’t named in the patent filing itself, one of the images that accompany the patent is a screen shot of the front page of Google Sets.


Copyright © 2008 SEO by the SEA. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at may be guilty of copyright infringement. Please contact SEO by the Sea, so we can take appropriate action immediately.
Plugin by Taragana

From The Blogs

Internet Observation

2007
Google惧怕Facebook的理由?
Robert Scoble有一篇题为“为什么Mahalo、TechMeme和Facebook会在四年内打败Google”的博客,其中他表明基于社会图景(如,Facebook关系)或人的(如,Mahal... 查看全文

Internet Observation

2007
网站的用户界面,等同于网站的品牌
Jerff Atwood写了一篇博客,内容是关于“Google在用户界面设计上所犯的第一大错误”。Google保守的用户界面是值得人称赞的。但是他们主页用户界面上的一部分,那个每天被登录数百万次的地方... 查看全文

Internet Observation

2007
Google是怎么做的?
Google就是权威。他们打造了一个很好的搜索引擎,并且在这七年中,始终保持领先地位。他们用AdWords重塑了在线广告,并且给予网站的每一个人有能力利用AdSense去制作buck。在这里,Gmai... 查看全文

Fashion Innovation,Digital Products

2007
34家公司进军谷歌的新开放式资源手机平台
34家公司已经联盟加入谷歌的新开放式资源手机平台"Android", T-Mobile, HTC, Samsung, LG 和 Motorola 只是其中的五家。          此联盟的宗旨是互相... 查看全文

Internet Observation

02-28
Google力压群雄当选英国第一大品牌
根据Superbrands最新的调查结果显示,Google已经超越了Microsoft和BBC,当选英国第一大品牌。     附上一些有趣的数据观察:Google是榜单上唯一一家1990年后才起步创业... 查看全文

Geek About

01-15
Top 17 Most Bizarre Sights on Google Earth
Satellite imagery used to be the exclusive domain of governments and spy agencies, but ever since Google Maps and Google Earth we can all get to see weird things! Fancy a look at Area 51? Wondered wha... 查看全文

World,Fashion, Entertainment

03-06
掌控好你们的搜索结果吧
一直以来,令我惊讶的是,一些高姿态的网站无法控制自己的搜索结果。看看下面的截图吧,"飞利浦剃须刀" ,你可以看到,要想跳过这10条紧密相关的赞助名单、3条产品的搜索结果和3条相关机构的搜索结果,几乎是... 查看全文

LifeReboot.com

04-13
My Interview at Google
I sat in the waiting room with one other applicant.He was older than me by about ten years.Judging by our clothes, it was clear that we were taking different approaches to this once-in-a-lifetime oppo... 查看全文

IntoMobile » Platforms

04-11
Opera Mini ported to Google's Android
Opera Mini, probably the best piece of mobile software even written in the past few years, I strongly encourage you to check out Beta 4.1, has been ported to Googles Android operating system. The proc... 查看全文

Internet Observation

2007
Google将携手移动电话行业
Google宣布,它将初次涉足无线电市场,并将与手机运营商、手机制造商以及软件开发商合作,创造一个全新的移动电话领域。这将是Google构建其广告帝国的重要一步。与苹果的iphone不同的是,Goog... 查看全文
More Articles