Semantics and the Future of the Desktop

August 19th, 2008

Nova Spivak, the CEO of Twine, has written another very thoughtful article about the future of Web apps and what he calls “Webtops,” a combination of the desktop and the Web.

Read the article here.

I appreciate this article because it paints a great vision of what Cognition’s semantic technology can do in the future of Web-based applications.  Integrating semantic search, plus a proactive semantic filtering to deliver more people, places and things of interest to the user, represents a great win for computer users and for Cognition, as our Semantic NLP and Semantic Map can be at the core of these new Web services. 

Check it out and let us know what you think!

-TB

Nice blog found for medical research

August 15th, 2008

I just found this blog called “the health informaticist” and if you’re a medical/health researcher the author has some good info.  He even gave a shout out to Cognition’s SemanticMEDLINE here.  Thanks, Alan!

-TB

How Web Technology is Boosting Productivity in Organizations

August 14th, 2008

ReadWriteWeb.com has written a great article about web technology boosting productivity here.  In the article, ReadWriteWeb’s editor, Richard MacManus, makes a very nice mention about Cognition Technologies:

“The other way startups are adding innovation to the enterprise is through new types of products that are tackling problems such as information overload. An example that we’ve been wanting to mention for a while now is Cognition Technologies, which is licensing its semantic web technologies to various organizations. Cognition’s Natural Language Processing (NLP) technology is being used for a number of different applications, from health to law. While not so much enterprise at this time, it’s easy to see how Cognition’s technology could be used to filter information where ever there is a large data set - which is many businesses these days.”

Read the entire ReadWriteWeb article here.

 

Cognition’s August Newsletter…”Inside the Mind”

August 6th, 2008

It’s here!  It’s hot off the press.  Check out the latest (and first) issue of “Inside the Mind”.

The Semantic Web’s Tagging System Already Exists – And It’s Not RDF or OWL

August 5th, 2008

By Dr. Kathleen Dahlgren, CTO

 

Over the course of the several years, there has been significant discussion on how to move the Web towards the vision of Web 3.0 (the Semantic Web).  At the time, it appeared that the only course of action was to develop a common tagging language (RDF and OWL) and then attempt to impose these standards of content creators on the Web.  Granted, there weren’t many other alternatives to this approach at the time and it appeared that a “brute force” approach to the problem was the only way to get the process moving.  That was then – this is now:  RDF and OWL are insufficient to meet the needs of the Semantic Web (Web 3.0).  There are much better alternatives today, many of which have only recently made themselves known (e.g. Powerset, Hakia, Cognition Technologies, etc.), which employ new semantic technologies and capabilities which render the need for tagging obsolete.  Tagging is unnecessary for Natural Language Processing (NLP) systems, extremely labor intensive, it requires a broad consensus amongst users and content creators, and is unenforceable.  The way the Web becomes semantic is to employ the only foundational standard necessary — the English language.

 

First, let’s define the terms:  The commonly-known Semantic Web creates hierarchical relationships and descriptions of Web using a special markup language in XML.  The initial language used for this was RDF (Resource Description Framework,) which was later augmented with a higher level language OWL (Web Ontology Language).  An example of an RDF tag as applied to the description of a music CD:

 

<?xml version=”1.0″?>
 
<rdf:RDF
xmlns:rdf=”http://www.w3.org/1999/02/22-rdf-syntax-ns#” 
xmlns:cd=”http://www.recshop.fake/cd#”> 
<rdf:Description
 rdf:about=”http://www.recshop.fake/cd/Empire Burlesque”>
  <cd:artist>Bob Dylan</cd:artist>
  <cd:country>USA</cd:country>
  <cd:company>Columbia</cd:company>
  <cd:price>10.90</cd:price>
  <cd:year>1985</cd:year>
</rdf:Description>
<rdf:Description
 rdf:about=”http://www.recshop.fake/cd/Hide your heart”>
  <cd:artist>Bonnie Tyler</cd:artist>
  <cd:country>UK</cd:country>
  <cd:company>CBS Records</cd:company>
  <cd:price>9.90</cd:price>
  <cd:year>1988</cd:year>
</rdf:Description>
</rdf:RDF>   

 

As you can see, information about the page is marked up in a formal language, indicating the artist, country, company, price etc. of the CD.  Such mark-up is a form of document tagging.  

 

The complication isn’t just in the fact that someone has to create these tags for all Web content.  A more significant issue is that to find documents by these tags, users have to know what the system of tags are. To the extent that all coders of RDF or OWL can coordinate to use the same system of tags, then the tagged pages could become accessible to a wide spectrum of Internet users.  A typical way of accessing tags is to offer users a menu of choices for values of the tags.  Unfortunately, this creates more work and inefficiency.  Also, it injects a certain amount of subjectivity into to the process because the content creator or manager is the arbiter of which tags will or will not be used.  As a result, if the Search process is left simply to using pre-defined and subjective RDF or OWL tags, then the searcher may miss what he or she is searching for.  Being able to Search, understand and utilize all of the text on a Web page naturally results in higher relevancy and a more satisfying Search experience.

 

So, rather than use a structured, subjective and incomplete dictionary of tags, how about we use an already established, agreed upon and universal “tagging system” – the English language itself.  Semantic NLP uses English as the means of communications – a system of symbols, if you will, that has already been agreed upon by English speakers.  No new system of symbols needs to be created, calibrated, propagated and used.

 

Cognition’s Semantic NLP™ offers free-text, Semantic search in the body of documents and Web pages.  The advantages of searching semantically in free-text are:

 

1)    No labor to create tags as all of the content is indexed and used;

 

2)    Any information in a document can be searched for;

 

3)    Users don’t need to know the exact way a concept was expressed (Alternative ways of expressing concepts are recognized as meaning the same thing and relevant to the query.   For example, the pages in the RDF example above could be found with “cost” as well as “price”.); 

 

4)    Words are disambiguated within the context of how they are used, so “price” meaning “consequences of an action” would not be retrieved in response to a query about “price” meaning “monetary cost”.

 

For categorized (tagged) content already in place, Cognition’s Semantic NLP can be used in conjunction with tags to create a versatile free-text Search and a structured data Search.  This can be seen using the Advanced Search on Cognition’s SemanticMEDLINE™ Website (www.SemanticMEDLINE.com), where users can search by author, date and journal of abstracts along with free concept Search. 

 

With Microsoft’s acquisition of Powerset, Yahoo!’s decision to open its platform to RDF (tagging) and micro-formats, and Hakia’s ontological (categorization) vision, the Semantic Web is becoming more of a reality.  Cognition’s Semantic NLP, which is built on a vast and complete Semantic Map of the English language, has done the majority of the work that RDF and OWL are intended to do: render the content on a Web page semantically retrievable. 

 

(NOTE: You can also find this article at AltSearchEngines.com here.)

InfoToday.com Covers SemanticMEDLINE.com Launch

July 28th, 2008

With traditional keyword search engines, such as those used by Google, Yahoo!, and others, finding the best medical research document within complex data sets, such as MEDLINE, is very difficult without the use of complex Boolean equations and a deep understanding of the many permutations of technical synonymy. Cognition’s SemanticMEDLINE has the ability to target and locate these types of data that are otherwise hidden in masses of information because of its comprehensive Semantic Map (particularly deep within the health sciences discipline) and its unique ability to “understand” the meaning behind words, phrases, and idioms.

Read the full article here.

Award-Winning Merrill Corporation Enhances Litigation Case Management Solution

July 24th, 2008

Cognition Technologies is talked about in Merrill’s latest press release about their product:

To improve early identification and review of key evidence, version 5.6 also features conceptual search tools, offered via a licensing agreement with Cognition Technologies, Inc. Unlike keyword searches, conceptual searching interprets the meaning of words within the context they are used to help computers “understand” concepts, significantly increasing the precision of the discovery results. “Cognition’s Semantic NLP program is revolutionary in the methodology used to conduct complex searching through algorithms. The semantic natural language search component allows our clients to conduct more meaningful content searches and uncover relevant evidence more quickly.” said Harold Leach, senior vice president of IT for Merrill’s Electronic Discovery Services.” 

Read full release here.

Cognition Launches Semantic MEDLINE

July 23rd, 2008

LOS ANGELES – July 23, 2008 —Cognition Technologies, a next-generation Semantic Natural Language Processing (NLP) company, announces a quantum improvement in the application of NLP technology with the introduction of Semantic MEDLINE™ – the 18 million article abstract database of complex health information published by the National Library of Medicine. This new free service at www.SemanticMEDLINE.com enables complex health and life science material to be rapidly and efficiently discovered with greater precision and completeness. This marks the first time that users can employ a natural, conversational sentence structure to find the most complex studies within the MEDLINE dataset.

Read the full release here.

NY Times Blogs About Semantics and We Join Conversation

July 22nd, 2008

Saul Hansell wrote an interesting post on the NY Times blog entitled “Google and the Real Search for Meaning on the Web.”

We joined the conversation with this comment (link) because people still don’t understand how critical a vast and complete Semantic Map of the English language is to true understanding of text:

This is a great discussion! However, what we don’t see anyone talking about is what we believe is the key to making Natural Language Processing successful – the Semantic Map.

To advance NLP beyond its current limitations is a very difficult problem because you have to unravel the complexity of language. In particular, individual words have multiple meanings, and at the same time, a given concept or meaning can be referred to by multiple words. To understand which meaning of a word is correct requires an understanding of context, but in order to fully understand context requires that words and meanings be analyzed or “curated” one at a time into a vast Semantic Map.

Cognition’s Semantic Map of the English language, which has been built over the past 23 years, is complete, robust and unique, and represents the key market differentiator for the company and its technology.

Utilizing a Semantic Map differs from Google’s algorithms in that Google relies primarily on string pattern-matching, and does not try to interpret the meaning of the words within context. With a Semantic Map, it is possible to determine word meanings and therefore understand meanings rather than simple string pattern recognition. For example, in Wikipedia, in response to a query “strike oil in California”, Google returns documents about “strike commander” (a flight simulator), striking workers, hunger strikes, etc., in their top 5 results. With a Semantic Map, the most precise and complete retrievals about discovering oil in California are returned, because the meaning of “strike” in context is interpreted as “discover”, as opposed to “attack militarily” or “walk out”.

Cognition Technologies has changed the NLP paradigm through its unique and complete combination of a complete Semantic Map and linguistic elements to optimize semantic understanding:

Morphology

• The various forms of word, e.g. singular, plural, tense

Syntax

• The grammatical structure, e.g. verbs, nouns

Semantics

• Word and sentence meaning, augmented by synonymy and taxonomy

Spelling

• The various ways words are spelled (or misspelled)

We encourage you to try it for yourself at www.cognition.com.

Alt Search Engines Covers Cognition’s Funding

July 16th, 2008

Charles Knight has the story:

Cognition has completed an institutional and individual financing round totaling $2.7 million.  Investors include Draper Associates  Tim Draper), Fingerhut Ventures, and a personal investment by the company’s  CEO, Scott Jarus.

Link [AltSearchEngines.com]