22 December 2011

Windows 8 - A Vani, III B.Tech IT 'B'

Windows 8:

Windows, being one of the most used OS, brings great challenges for
Microsoft to keep its users impressed in the next version of Windows.
Although Windows 7 has done exceptionally well since its release,
Microsoft has been constantly working to make the next version of
Windows a better one by further improving upon the existing features,
and have added new impressive Windows 8 features along with a stunning
UI.

Windows 8 is the codename for the next version of the Microsoft
Windows computer operating system following Windows 7. It has many
changes from previous versions. In particular it adds support for ARM
microprocessors in addition to the previously supported x86
microprocessors from Intel and AMD. A new Start Screen interface has
been added that was designed for touchscreen input in addition to
mouse, keyboard, and pen input.
Windows 8 is a Part of the Microsoft Windows family, which is being
developed by Microsoft Corporation,with the kernel type of Hybrid,
licensed by Proprietarycommercial software. Platform supported are
IA-32, x86-64, and ARM.
The main feature that was shown was the new user interface.

Early announcements
In January 2011, at the Consumer Electronics Show (CES), Microsoft
announced that Windows 8 would be adding support for ARM
microprocessors in addition to the x86 microprocessors from Intel and
AMD.

Build conference and developer preview
Microsoft unveiled new Windows 8 features and improvements on
September 13, 2011, day one of the BUILD developer conference.
Microsoft also released a Developer Preview build (Build 8102) of
Windows 8 for the developer community to download and start working
with. According to Microsoft, there were more than 500,000 downloads
of the Windows 8 Developer Preview within the first 12 hours of its
release. Microsoft has shown a development roadmap at the BUILD
conference stating that they will develop Windows 8 using the same
milestones as Windows 7 did.
Hardware requirements:
Minimum hardware requirements for Windows Developer Preview:

Architecture IA-32
x86-64
ARM

Processor
1 GHz TBA
Memory (RAM)
1 GB 2 GB TBA
Graphics Card
DirectX 9graphics device with WDDM1.0 or higher driver
TBA
HDDfree space
16 GB 20 GB TBA

A multi-touch screen is required to use touch input. For Metro
applications, a screen resolution of 1024x768 or higher is required.
Microsoft may also require new PCs to have the UEFI secure boot
feature enabled by default to be given Windows 8 certification. There
has been some concern that it could lead to machines that do not
support alternative operating systems. The manufacturer is free to
choose which signatures are accepted by the feature and to offer the
ability to turn off the secure boot feature.
Microsoft has revealed the following maximum supported hardware
specifications for Windows Server 8 at the BUILD conference.
Logical processors
640 (was 256 in Windows Server 2008 R2)
Random-access memory
4 TB (was 2 TB in Windows Server 2008 R2)
Failover cluster nodes 63 (was 16 in Windows Server 2008 R2)
Windows 8 Minimum System Requirements
It is reported that Windows 8 is expected to have the same system
requirements as Windows 7, and will run on existing PCs/laptops that
run Windows 7. Here are the following minimum system requirements to
run Windows 8:
Processor: 1GHz or faster 32-bit or 64-bit processor
RAM: 1GB (32-bit) or 2GB (64-bit)
HD Space: 16GB for 32-bit (or 20GB for 64-bit)

Compatibility:
Windows 8 for x86 processors will run much software compatible with
previous x86 versions of Windows, with the usual restrictions: 64-bit
Windows will run also 32-bit software but not 16-bit ones; 32-bit
Windows will optionally run 16-bit software if installed to do so, but
will not run 64-bit software. Either 32- or 64-bit Windows can be
installed on x86-64 processors. Some expertise in manipulating
compatibility settings may be required to run, for example, 16-bit
software for Windows 3.x under 32-bit Windows 8, in cases where it is
possible. In particular, applications compatible with 32- and 64-bit
Windows 7 will run in the same way on Windows 8.
Windows 8 for ARM processors will not run software created for x86;
software will have to be ported by its developers to create ARM
executables from source code.

Windows 8 Features:
Windows 8 offers a number of new Windows 8 features from its all new
UI to Windows App Store and improvised security features. Here's a
list of new features and improvements which will arrive with Windows
8:
• Windows 8 will contain a new user interface based on Microsoft's
design language named Metro. With the new change, the Start Menu was
replaced in favor for the new Start Screen, where there are tiles that
contain shortcuts to applications, Metro style applications, and
updating tiles, similar to Windows Phone.
• A new authentication method allows users to sketch in three
different places over the picture to login, instead of typing a
password.
• Windows Explorer now uses a ribbon interface, similar to those used
in Microsoft Office applications.
• Another feature expected to be introduced in Windows 8 is native USB
3.0 support, without the need to load drivers.
• Windows 8 will come with Windows Store, an online marketplace for
buying, selling, and advertising applications.
• Windows 8 can be run from a USB-connected drive, such as a flash
drive. This feature is called Windows To Go. It is intended for
enterprise administrators to provide users with a Windows 8 image that
reflects the corporate desktop.
• Windows 8 will support multiple monitors with the new ability to
natively display different background images on each display and
customized taskbar(s) on each of the connected displays.
• The Developer Preview comes with two new recovery functions. Refresh
and Reset, which both make a complete restore easier than a
re-installation. The former keeps all the settings and files of the
user intact and only reverses all changes to Windows files to its
original state while removing all installed programs and apps. The
latter deletes all files and effectively re-installs Windows, but
without any additional user input such as agreeing to license
agreements or selecting a hard disk required. After a reset completes,
the user will be asked for the product key and will then proceed to
account creation.
• One big change is that user accounts do not have to be local-only
(or from an Active Directory domain) anymore but can be linked up to
one's Windows Live ID. This has the advantage that users will not lose
their settings and files as they move from their home computer to
their work laptop or to any other computer also using Windows 8.
• Other new features include a new Welcome screen, a new packaged
application model called AppX that is based on Silverlight, and Open
Packaging Conventions, as well as a setting to automatically adjust
window color to fit the wallpaper.
• There is also a stripped down "Immersive" version of Internet
Explorer, using the similar Metro-based user interface of the mobile
version of Internet Explorer 9. The Immersive Version of Internet
Explorer 10 does not support ActiveX plugins, in order to be an
HTML5-only browser. The Desktop version of IE10 does support ActiveX
plugins.
• A new "Hybrid Boot" option that uses "advanced hibernation
functionality" on shutdown to allow faster startup times.
• A new version of Task Manager with a redesigned user interface is
also present in pre-release versions of Windows 8.

Support for both x86 PCs and ARM tablets
Windows 8 is the first edition of Windows which will work on both ARM
based tablets and traditional x86 (as well as x32 and x64) PCs based
on ARM processors from Intel and AMD.
"Support for ARM-based chipsets, touch, and sensors makes Windows 8
work beautifully on your choice of a full spectrum of devices, such as
10-inch slates with all-day battery life, ultra-lightweight laptops,
and powerful all-in-ones with 27-inch high-definition screens,".

Windows To Go
The "Windows To Go" seems to be an exciting feature that basically
allows Windows 8 to boot from a USB device (called as Live USB),
including user's programs, settings and files. The feature is designed
to work with both USB 2.0 and USB 3.0, and on both legacy NIOS and
UDFI firmware. However, Microsoft says that a user will not be able to
"Hibernate" with this feature.
"Windows To Go is a new feature in Windows 8 that enables enterprise
administrators to create USB drives containing complete, managed
Windows images that users can use to boot and run Windows on any
Windows 7 or Windows 8 capable computer. Windows To Go makes it
possible for employees to use a managed device whether they work from
home, a client office or in a free seating environment. This session
will discuss Windows To Go, its hardware requirements and building
compatible and complementary software."
Another interesting thing about this feature is that, if the USB drive
is removed, Windows will automatically freeze, but will continue to
operate if the USB drive is inserted back in the next 60 seconds after
removal.

Windows Store
To compete with Apple, Windows has confirmed the introduction of a
Windows Store, similar to Mac App Store, which allows users to browse
through Windows applications, while developers can publish their
Metro-style apps on Windows 8 devices.

Windows 8 User Interface
Speaking about the UI, Windows 8 certainly has got a mind blowing
interactive UI, which has been extensively redesigned to a
"Metro-style" design, which shows the most important information to
you, embodies simplicity, and gives you full control over it. The UI
is designed to provide a fluid and intuitive interaction and
navigation with touch, or a mouse and keyboard.

Improved Windows Defender
The Windows Defender which was first released with Windows Vista is
getting a security upgrade with the upcoming Windows 8. This would
probably edge-our third-party antivirus and anti-malware programs.
Windows Defender will become a full-fledged anti-malware solution for
Windows 8. This will include protection against all types of malware
such as virus, worms, trojans and rootkits.
In addition to that, Windows Defender will provide real-time
detection and protection from malware threats using a file system
filter, and will interface with Windows secured boot, which is another
security feature.

Internet Explorer 10
The developer preview of Windows 8 comes with a Metro version of
Internet Explore (IE) 10. Since it is a preview version, it does not
support any browser plugins and extension – including Adobe Flash and
Microsoft Silverlight. However, it will feature full screen browsing
mode by default, implements several new HTML5 capabilities,
turbocharges HTML5 performance, and adds some security options for Web
applications.

Windows 8 Release Date
We can expect Windows 8 to be launched sometime in mid-late 2012. The
only question that haunts each and every one of us – Will Windows 8
win the battle against Apple which it had lost several years back?
Let's see to that.

HTML5 - C. Sathiya Priya, III B.Tech IT 'A'

HTML5

HTML5 is a language for structuring and presenting content for the
World Wide Web, and is a core technology of the Internet originally
proposed by Opera Software. It is the fifth revision of the HTML
standard (created in 1990 and standardized as HTML4 as of 1997) and as
of November 2011 is still under development. Its core aims have been
to improve the language with support for the latest multimedia while
keeping it easily readable by humans and consistently understood by
computers and devices (web browsers, parsers, etc.). HTML5 is intended
to subsume not only HTML 4, but XHTML 1 and DOM2HTML (particularly
JavaScript) as well.

Following its immediate predecessors HTML 4.01 and XHTML
1.1, HTML5 is a response to the observation that the HTML and XHTML in
common use on the World Wide Web are a mixture of features introduced
by various specifications, along with those introduced by software
products such as web browsers, those established by common practice,
and the many syntax errors in existing web documents. It is also an
attempt to define a single markup language that can be written in
either HTML or XHTML syntax. It includes detailed processing models to
encourage more interoperable implementations; it extends, improves and
rationalises the markup available for documents, and introduces markup
and application programming interfaces (APIs) for complex web
applications. For the same reasons, HTML5 is also a potential
candidate for cross-platform mobile applications. Many features of
HTML5 have been built with the consideration of being able to run on
low-powered devices such as smartphones and tablets.
In December 2011 research firm Strategy Analytics forecast
sales of HTML5 compatible phones will top 1 billion in 2013. In
particular, HTML5 adds many new syntactical features. These include
the <video>, <audio>, <header> and <canvas> elements, as well as the
integration of SVG content that replaces the uses of generic <object>
tags. These features are designed to make it easy to include and
handle multimedia and graphical content on the web without having to
resort to proprietary plugins and APIs. Other new elements, such as
<section>, <article>, <header> and <nav>, are designed to enrich the
semantic content of documents. New attributes have been introduced for
the same purpose, while some elements and attributes have been
removed. Some elements, such as <a>, <cite> and <menu> have been
changed, redefined or standardized. The APIs and document object model
(DOM) are no longer afterthoughts, but are fundamental parts of the
HTML5 specification. HTML5 also defines in some detail the required
processing for invalid documents so that syntax errors will be treated
uniformly by all conforming browsers and other user agents.

History

The Web Hypertext Application Technology Working Group
(WHATWG) began work on the new standard in 2004, when the World Wide
Web Consortium (W3C) was focusing future developments on XHTML 2.0,
and HTML 4.01 had not been updated since 2000. In 2009, the W3C
allowed the XHTML 2.0 Working Group's charter to expire and decided
not to renew it. W3C and WHATWG are currently working together on the
development of HTML5.
Even though HTML5 has been well known among web developers
for years, it became the topic of mainstream media in April 2010 after
Apple Inc's then-CEO Steve Jobs issued a public letter titled
"Thoughts on Flash" where he concludes that Adobe "Flash is no longer
necessary to watch video or consume any kind of web content" and that
"new open standards created in the mobile era, such as HTML5, will
win". This sparked a debate in web development circles where some
suggested that while HTML5 provides enhanced functionality, developers
must consider the varying browser support of the different parts of
the standard as well as other functionality differences between HTML5
and Flash. In early November 2011 Adobe announced that it will
discontinue development of Flash for mobile devices and reorient its
efforts in developing tools utilizing HTML 5.

W3C standardization process

WHATWG started work on the specification in June 2004 under
the name Web Applications 1.0. As of January 2011, the specification
is in the Draft Standard state at the WHATWG, and in Working Draft
state at the W3C. Ian Hickson of Google is the editor of HTML5.
The HTML5 specification was adopted as the starting point of
the work of the new HTML working group of the World Wide Web
Consortium (W3C) in 2007. This working group published the First
Public Working Draft of the specification on 22 January 2008. The
specification is an ongoing work, and is expected to remain so for
many years, although parts of HTML5 are going to be finished and
implemented in browsers before the whole specification reaches final
Recommendation status.
According to the W3C timetable, it was estimated that HTML5
would reach W3C Recommendation by late 2010. However, the First Public
Working Draft estimate was missed by eight months, and Last Call and
Candidate Recommendation were expected to be reached in 2008, but as
of January 2011 HTML5 was still at Working Draft stage in the W3C.
HTML5 has been at Last Call in the WHATWG since October 2009.
Ian Hickson, editor of the HTML5 specification, expects the
specification to reach the Candidate Recommendation stage during 2012.
The criterion for the specification becoming a W3C Recommendation is
"two 100% complete and fully interoperable implementations". In an
interview with TechRepublic, Hickson guessed that this would occur in
the year 2022 or later. However, many parts of the specification are
stable and may be implemented in products:
In December 2009, WHATWG switched to an unversioned development
model for the HTML5 specification. W3C will still continue with
publishing a snapshot of the HTML5 specification.
On 14 February 2011, the W3C extended the charter of its HTML
Working Group with clear milestones for HTML5. The Working Group is
expected to advance HTML5 to "Last Call", an invitation to communities
inside and outside W3C to confirm the technical soundness of the
specification, in May 2011. The group will then shift focus to
gathering implementation experience. W3C is also developing a
comprehensive test suite to achieve broad interoperability for the
full specification by 2014, which is now the target date for
Recommendation.

Even as innovation continues, advancing HTML5 to Recommendation
provides the entire Web ecosystem with a stable, tested, interoperable
standard. The decision to schedule the HTML5 Last Call for May 2011
was an important step in setting industry expectations. Today we take
the next step, announcing 2014 as the target for Recommendation.
— Jeff Jaffe, Chief Executive Officer,

Markup:
HTML5 introduces a number of new elements and attributes
that reflect typical usage on modern websites. Some of them are
semantic replacements for common uses of generic block (<div>) and
inline (<span>) elements, for example <nav> (website navigation
block), <footer> (usually referring to bottom of web page or to last
lines of HTML code), or <audio> and <video> instead of <object>. Some
deprecated elements from HTML 4.01 have been dropped, including purely
presentational elements such as <font> and <center>, whose effects are
achieved using Cascading Style Sheets. There is also a renewed
emphasis on the importance of DOM scripting (e.g., JavaScript) in Web
behavior.
The HTML5 syntax is no longer based on SGML despite the
similarity of its markup. It has, however, been designed to be
backward compatible with common parsing of older versions of HTML. It
comes with a new introductory line that looks like an SGML document
type declaration, <!DOCTYPE html>, which triggers the
standards-compliant rendering mode. As of 5 January 2009, HTML5 also
includes Web Forms 2.0, a previously separate WHATWG specification.
New APIs
In addition to specifying markup, HTML5 specifies scripting
application programming interfaces (APIs). Existing document object
model (DOM) interfaces are extended and de facto features documented.
There are also new APIs, such as:

• The canvas element for immediate mode2D drawing. See Canvas 2D API
Specification 1.0 specification
• Timed media playback
• Offline storage database(offline web applications) Document editing
• Drag-and-drop
• Browser history management
• MIME typeand protocol handler registration
• Microdata


Not all of the above technologies are included in the W3C
HTML5 specification, though they are in the WHATWG HTML specification.
Some related technologies, which are not part of either the W3C HTML5
or the WHATWG HTML specification, are as follows. The W3C publishes
specifications for these separately.
• Geolocation
• Web SQL Database, a local SQL Database (no longer maintained).
• The Indexed Database API, an indexed hierarchical key-value store
(formerly WebSimpleDB).
• Web Storage, a key-value pair storage framework that provides
enhanced behaviour similar to Cookiesbut with larger storage capacity
and improved API. • File API, Handle file uploads and file
manipulation.
• Directories and System. This API is intended to satisfy
client-side-storage use cases not well served by databases.
• File Writer. An API for writing to files from web applications.

A common misconception is that HTML5 can provide
animation within web pages, which is untrue. Either JavaScript or CSS3
is necessary for animating HTML elements. Animation is also possible
using JavaScript and HTML 4, and within SVG elements through SMIL,
although browser support of the latter remains uneven as of 2011.

XHTML5
XHTML5 is the XML serialization of HTML5. XML documents
must be served with an XML Internet media type such as
application/xhtml+xml or application/xml. XHTML5 requires XML's
strict, well-formed syntax. The choice between HTML5 and XHTML5 boils
down to the choice of a MIME/content type: the media type one chooses
determines what type of document should be used. In XHTML5 the HTML5
doctype html is optional and may simply be omitted. HTML that has been
written to conform to both the HTML and XHTML specifications—and which
will therefore produce the same DOM tree whether parsed as HTML or
XML—is termed "polyglot markup".

Error handling
An HTML5 (text/html) browser will be flexible in handling
incorrect syntax. HTML5 is designed so that old browsers can safely
ignore new HTML5 constructs. In contrast to HTML 4.01, the HTML5
specification gives detailed rules for lexing and parsing, with the
intent that different compliant browsers will produce the same result
in the case of incorrect syntax. Although HTML5 now defines a
consistent behavior for "tag soup" documents, those documents are not
regarded as conforming to the HTML5 standard.

Popularity
According to a report released on 30 September 2011, 34
of the world's top 100 Web sites were using HTML5 – the adaptation led
by search engines and social networks.

Differences from HTML 4.01 and XHTML 1.x
The following is a cursory list of differences and some
specific examples.
• New parsingrules: oriented towards flexible parsing and
compatibility; not based on SGML
• Ability to use inline SVGand MathMLin text/html
• New elements: article, aside, audio, bdo, canvas, command, datalist,
details, embed, figcaption, figure, footer, header, hgroup, keygen,
mark, meter, nav, output, progress, rp, rt, ruby, section, source,
summary, time, video, wbr
• New types of form controls: dates and times, email, url, search,
number, range, tel, color
• New attributes: charset(on meta), async(on script)
• Global attributes (that can be applied for every element): id,
tabindex, hidden, data-(custom data attributes)
• Deprecated elements will be dropped altogether: acronym, applet,
basefont, big, center, dir, font, frame, frameset, isindex, noframes,
strike, tt

The HTML5 logo
On 18 January 2011, the W3C introduced a logo to represent
the use of or interest in HTML5. Unlike other badges previously issued
by the W3C, it does not imply validity or conformance to a certain
standard. As of 1 April 2011, this logo is official.hen initially
presenting it to the public, the W3C announced the HTML5 logo as a
"general-purpose visual identity for a broad set of open web
technologies, including HTML5, CSS, SVG, WOFF, and others". Some web
standard advocates, including The Web Standards Project, criticised
that definition of "HTML5" as an umbrella term, pointing out the
blurring of terminology and the potential for miscommunication. Three
days later, the W3C responded to community feedback and changed the
logo's definition, dropping the enumeration of related technologies.
The W3C then said the logo "represents HTML5, the cornerstone for
modern Web applications".

iPhone 4S-The most amazing iPhone yet - Uma Mohan, II B.Tech IT 'B'

iPhone 4S-The most amazing iPhone yet

Apple unveiled their new iPhone 4S on October 4 at the Apple's "Let's
talk iPhone" event at Cupertino. Here's some of the information that
we need to know about the device iPhone 4s. Steve Jobs described the
phone as, "beyond a doubt, the most precise thing and one of the most
beautiful things we've ever made" before highlighting a number of its
new features. The device runs iOS 5 by default.

Bands (CDMA & GSM), Hardware, WiFi and battery life:

It is now a world phone, where you don't need separate device to use
CDMA or GSM. The new internal antenna system has been rethought and
now can access both GSM and CDMA signals. The antenna gate problem is
also sorted out. The iPhone 4S's hardware is now improved. It now
sports an A5 processor, the processor that runs on iPad 2 with an
amazing speed. The device also has a new GPU that enhances the gaming
experience. A demo of the Unreal Engine's Infinity Blade 2 was made
which showcased graphics in pair with a high-end computer gaming. The
new iPhone 4S sports an advanced WiFi antenna system. The phone can
now download at speeds of 14.4 mbps, which marks the highest in the
segment when it comes to WiFi download speeds from its competitors.
The iPhone 4S is now powered by a powerful battery, and is capable of
providing 8 hours of talk time on 3G, 14 hours of talk time on 2G, 300
hours of standby time, 6 hours of internet usage in 3G and 9 hours of
internet usage in WiFi.The iPhone is also capable of playing up to 10
hours of video and more than 40 hours of audio. This is incredible.

Camera and screen resolution:

The iPhone 4S's camera has been buffed up and the device now have an 8
mega pixel camera built into it. As usual, the device is capable of
shooting HD videos at 1080p.The camera's resolution uses the actual
pixels to shoot a photo. The picture resolution is 3264 x 2448, which
is really huge and the clarity is just amazing when seen on the phone
as well as on the computer. The phone uses a new Hybrid IR filter
which allows users to take brilliant shots even in low-light
situations. The phone also will have a improved screen resolution: 326
pixels per inch with heightened contrast. Jobs told that the 300
pixels per inch is the limit of the ability of the human retina to
view images. The camera application now comes with an in-built Face
Detection system that would recognize the faces of people
automatically.

iOS 5:

The iOS 5 is definitely one of the most stable and advanced mobile
operating system on the planet. With more than 200 feature
improvements from its previous version of iOS, the software is cool,
stable and is efficient in handling multi-tasking. iOS 5 is a
tremendous upgrade for the iPod touch, and so is iCloud.

Siri Assistant:

iOS 5 that comes with iPhone 4S is now equipped with Siri's voice
based assistant that would help you in getting information both from
your phone as well as from the internet with just your voice. The new
personal assistant has got a system-wide integration. The new feature
has been restricted only to the iPhone 4S. we can get a list of
examples of what we can do, including meetings, emails, directions,
weather, stocks, looking up contacts, creating notes.

How Siri Works?

Based on Applidium's reverse engineering, when you talk to your iPhone
4S' Siri, our voice is recorded and stored in a temporary location.
The audio is then compressed on the device with Speex, and is
transmitted to the servers for processing using a special and
encrypted HTTP request. After the servers process the speech, the
server sends back a zlib compressed package that contains the response
for your question. Every request that goes into the Apple's Servers is
SSL-encrypted and is highly secure. The technology uses a custom HTTP
request called ACE. This has a high Content-Length value and also a
special string that would let the server identify it as a assistant.
Adding to the amount of security work that Apple has done on Siri and
its servers, the packets also has a special header that sends the UDID
of the device. Not the regular UDID you see on iTunes. The servers
would process the request ONLY if it thinks that the UDID's are
correct and are valid and are coming from the actual iPhone 4S. But,
if this could be replicated, then it would be not so difficult to port
Siri onto other devices.

BIOMEDICAL TEXT MINING - Sakthi Nathiarasan, III B.Tech IT 'B'

BIOMEDICAL TEXT MINING

Text mining, sometimes alternately referred to as text data mining, roughly equivalent to text analytics, refers to the process of deriving high-quality information from text.  High-quality information is typically derived through the devising of patterns and trends through means such as statistical pattern learning. Text mining usually involves the process of structuring the input text (usually parsing, along with the addition of some derived linguistic features and the removal of others, and subsequent insertion into a database), deriving patterns within the structured data, and finally evaluation and interpretation of the output. 'High quality' in text mining usually refers to some combination of relevance, novelty, and interestingness. Typical text mining tasks include text categorization, text clustering, concept/entity extraction, production of granular taxonomies, sentiment analysis, document summarization, and entity relation modeling (i.e., learning relations between named entities

Text mining uses the process of "stemming". Words that have similar root stems are considered the same. Therefore similar items can be grouped using text mining to reduce the number of categories.

Biomedical text mining (also known as BioNLP) refers to text mining applied to texts and literature of the biomedical and molecular biology domain. It is a rather recent research field on the edge of natural language processing, bioinformatics, medical informatics and computational linguistics.

There is an increasing interest in text mining and information extraction strategies applied to the biomedical and molecular biology literature due to the increasing number of electronically available publications stored in databases such as PubMed.

.some tools used for biomedical data mining are listed below:

  KLEIO - an advanced information retrieval system providing knowledge enriched searching for biomedicine.

  FACTA+ - a MEDLINE search engine for finding associations between biomedical concepts. The FACTA+ Visualizer helps intuitive understanding of FACTA+ search results through graphical visualization of the results

  U-Compare - U-Compare is an integrated text mining/natural language processing system based on the UIMA Framework, with an emphasis on components for biomedical text mining.

 TerMine - a term management system that identifies key terms in biomedical and other text types.

 MEDIE - an intelligent search engine to retrieve biomedical correlations from MEDLINE, based on indexing by Natural Language Processing and Text Mining techniques

 AcroMine - an acronym dictionary which can be used to find distinct expanded forms of acronyms from MEDLINE

 AcroMine Disambiguator - Disambiguates abbreviations in biomedical text with their correct full forms.

 GENIA tagger - Analyses biomedical text and outputs base forms, part-of-speech tags, chunk tags, and named entity tags

 NEMine - Recognizes gene/protein names in text

 Yeast MetaboliNER - Recognizes yeast metabolite names in text.

Smart Dictionary Lookup - machine learning-based gene/protein name lookup.

Chilibot — A tool for finding relationships between genes or gene products.

CLUSTERING ALGORITHM FOR WEB APPLICATION BASED ON TEXT MINING - Sakthi Nathiarasan III B.Tech IT 'B'

CLUSTERING ALGORITHM FOR WEB APPLICATION BASED ON TEXT MINING

 

Text mining, sometimes alternately referred to as text data mining, roughly equivalent to text analytics, refers to the process of deriving high-quality information from text. High-quality information is typically derived through the devising of patterns and trends through means such as statistical pattern learning

There are a number of well-known java-based open-source text mining applications. APIs inherent within the text mining research and communities. Of perhaps greatest importance are KEA (developed by the same engineering group as WEKA data mining tool developed) and Carrot2

KEA versus Carrot2

KEA stands for Keyphrase Extraction Algorithm and, as its name suggests, is used to extract keyphrases and words from text. It is a verbose system which allows for both controlled indexing and free indexing. The forma is concerned with extracting keyphrases from documents with reference to a controlled vocabulary providing for a more concept-based approach to extraction whilst the latter does not consider different term meaning e.g. lap-top will not be considered the same as note-book. These are natural language processing anomolies. KEA does not undertake clustering, but instead employs a supervised learning approach and builds a Baive Bayes learning model through a set of training documents with known keyphrases. For the purposes of training, phrase "term-frequency-inverse-document-frequency" (TF-IDF) weight as well as position within the document are considered. (El-Beltagy, 2006)

 

Carrot2 has been introduced previously. This scheme is very different to KEA, as it takes-on an unsupervised approach to text mining and phrase extraction. In terms of implementation however, there are many similarities in the initial stages of text mining. Consider the following:

 

The above diagram illustrates a summary of the information retrieval and document preprocessing implementation of text mining employed by both KEA and Carrot2 (however, with Carrot2 specifically geared for web search results).

 Carrot2 explicitly uses Apache Lucene 2.2 (adjusted for web search resutls) to facilitate the above processes. Each broadly undertakes each of -> points depicted in the central box, except for the marked [] which relates to the KEA controlled indexing approach to text mining. The other marking within this box * ear-marks variation between the two text mining schemes and well as signing options within the individual schemes. Each is now considered:

1. Stemming: there are a number of implicit options available to both schemes with regards to stemming-

a. Lovins Stemmer: the most aggressive stemmer, with some 294 nested rules;

b. Porter Stemmer: this more subdued approach has some 37 rules;

c. Partial Porter Stemmer: the Porter Stemmer has 5 multi-part stages which allow the "miner" to be more or be less conservative in their stemming process. The first stage Porter stemmer is a popular methodology, handling basic plurals e.g. horses becomes horse, processes becomes process, men does not become man; and

d. No Stemmer.

2. Stop words: KEA defines some 499 stop-words, Carrot2 some 324 stop-words (adjusted lucene), lucene only some 33 and the Brown Corpus some 425 (Sharma et Raman, 2003).

3. Vocabularies: This is only relevant to KEA. The "miner" is able to dicate a dictionary, thesaurus or list of terms when undertaking controlled indexing. Also, in terms of implementation, these can be in either text form or resourse description format ("rdf"). A number of popular vocabularies are in circulation, including the integrated public service vocabulary ("ipsv") and the agrovoc vocabulary.

How do these systems relate to my thesis?

Implementation of clustering: LSA/SVD & description comes first - replacing KEA's NaiveBayes classifier with Carrot2's lingo algorithm. These frameworks are both java based and open-source.

13 December 2011

Bump Top - M. Abirami, III B.Tech IT 'A'


BumpTop

 In computing, BumpTop is a desktop environment that simulates the normal behavior and physical properties of a real world desk, and enhances it with automatic tools to organize its contents. It is aimed at stylus interaction, making it more suitable for tablet computers and handheld PCs. It was created at the University of Toronto as Anand Agarawala's Masters Thesis. Anand Agarawala also gave a presentation at the TED conference about his idea. The 1.0 version was released on April 8, 2009, along with a fully featured pro version as a paid upgrade. On April 30, 2010 the author announced that BumpTop was being discontinued and that they were taking the software "...in an exciting new direction..." Two days later on May 2, it was announced that the company had been acquired by Google. On January 5, 2011 Google released a "sneak peek" video of Android 3.0 Honeycomb showing a 3D desktop with features purportedly taken from BumpTop.

In BumpTop, documents are described by three-dimensional boxes lying on a virtual desk. The user can position the boxes on the desk using the stylus or mouse. Extensive use of physics effects like bumping and tossing is applied to documents when they interact, for a more realistic experience. Boxes can be stacked with well-defined gestures. Multiple selection is performed by means of a LassoMenu, which fluidly combines into one stroke the act of lasso selection and action invocation via pie menus. BumpTop currently supports Windows XP/Vista/7, and a version for Mac OS X was released into private beta on January 18, 2010. The Mac edition omits the pie menu in favor of a more normal selection menu.

The software installer and the application phone home. While the download page on the official website does state: "Internet connection required for activation", this may not appear in other sources such as Cnet. There is no explicit dialog box asking the user to confirm this connection at the time it is required. BumpTop automatically updates to the latest version.

Multi-Touch:

With the release of BumpTop 1.2 on October 9, 2009, multi-touch support was added for Windows 7. It added 14 new gestures to the system that used multiple touches on the screen. One such gesture is "scrunching" your hand to pull files into a pile. Just like the regular version of BumpTop, the extensive use of physics is applied to these multi-touch gestures. Multi-touch support has since been added to Mac OS X as well.

Multi-touch support is currently only available in the Pro version of the software.

BumpTop Inside

On August 18, 2009, BumpTop announced their new "BumpTop Inside" program. Partnering with HIS, PowerColor, and SAPPHIRE, a free copy of BumpTop will be included with their graphics cards. The reasoning behind the partnership was to allow BumpTop to be spread to more customers, as well as allowing BumpTop to use the power of the newly bought graphics card. "BumpTop creates a brand new user experience for computer desktops," said Ted Chen, CEO of TUL Corporation. "We are excited to add this innovative application into our graphics solution. Backed by the power of PowerColor graphics, BumpTop will make the user experience more vivid and change the way the traditional computer desktop is used."

Reception

BumpTop has been generally well received by critics, with many of them excited about the possibility of BumpTop's features becoming standard in GUIs. CNET Editor Seth Rosenblatt gave it 5 stars, Thanks to the added multi-touch support, Engadget says that "Bumptop gives Windows 7 touchscreen PCs purpose." CrunchGear simply says "BumpTop: A Better Windows desktop." On BumpTop's website, they have 23 quotes of positive reviews from professional editors.

"This program is a real looker. When I showed it to friends, they immediately wanted to play with it, dragging pictures to pin on virtual walls and drawing circles around several icons at once to stack them into piles like magic. BumpTop is a pleasure to use and for no cost, the free version is certainly worth a try. If you like what you see and you're willing to change the way you use your desktop, the Pro version might be worthwhile."

Acquisition by Google

In April 2010 it was announced that Bumptop had been acquired by Google. It is unknown what Google has planned for the software, but speculation has begun it plans to revamp it for a new Android based tablet UI. Shortly after its acquisition, Bumptop announced plans to remove the software completely from its website, only giving "End of Life" support to those who bought the Pro version.

Wearable Gestural Interface - M. Abirami, III B.Tech IT 'A'

WEARABLE GESTURAL INTERFACE

SIXTH SENSE:

 

Integrating information with the real world:

 

      'SixthSense' is a wearable gestural interface that augments the physical world around us with digital information and lets us use natural hand gestures to interact with that information.

     The SixthSense prototype comprises a pocket projector, a mirror and a camera contained in a pendant like, wearable device. Both the projector and the camera are connected to a mobile computing device in the user's pocket. The projector projects visual information enabling surfaces, walls and physical objects around us to be used as interfaces; while the camera recognizes and tracks user's hand gestures and physical objects using computer-vision based techniques. The software program processes the video stream data captured by the camera and tracks the locations of the colored markers (visual tracking fiducials) at the tip of the user's fingers. The movements and arrangements of these fiducials are interpreted into gestures that act as interaction instructions for the projected application interfaces. SixthSense supports multi-touch and multi-user interaction.
 

We've evolved over millions of years to sense the world around us. When we encounter something, someone or some place, we use our five natural senses to perceive information about it; that information helps us make decisions and chose the right actions to take. But arguably the most useful information that can help us make the right decision is not naturally perceivable with our five senses, namely the data, information and knowledge that mankind has accumulated about everything and which is increasingly all available online.

 

      Although the miniaturization of computing devices allows us to carry computers in our pockets, keeping us continually connected to the digital world, there is no link between our digital devices and our interactions with the physical world. Information is confined traditionally on paper or digitally on a screen. SixthSense bridges this gap, bringing intangible, digital information out into the tangible world, and allowing us to interact with this information via natural hand gestures. 'SixthSense' frees information from its confines by seamlessly integrating it with reality, and thus making the entire world your computer.

The SixthSense prototype is comprised of a pocket projector, a mirror and a camera. The hardware components are coupled in a pendant like mobile wearable device. Both the projector and the camera are connected to the mobile computing device in the user's pocket. The projector projects visual information enabling surfaces, walls and physical objects around us to be used as interfaces; while the camera recognizes and tracks user's hand gestures and physical objects using computer-vision based techniques. The software program processes the video stream data captured by the camera and tracks the locations of the colored markers (visual tracking fiducials) at the tip of the user's fingers using simple computer-vision techniques. The movements and arrangements of these fiducials are interpreted into gestures that act as interaction instructions for the projected application interfaces. The maximum number of tracked fingers is only constrained by the number of unique fiducials, thus SixthSense also supports multi-touch and multi-user interaction.

       The SixthSense prototype implements several applications that demonstrate the usefulness, viability and flexibility of the system. The map application lets the user navigate a map displayed on a nearby surface using hand gestures, similar to gestures supported by Multi-Touch based systems, letting the user zoom in, zoom out or pan using intuitive hand movements. The drawing application lets the user draw on any surface by tracking the fingertip movements of the user's index finger. SixthSense also recognizes user's freehand gestures (postures). For example, the SixthSense system implements a gestural camera that takes photos of the scene the user is looking at by detecting the 'framing' gesture. The user can stop by any surface or wall and flick through the photos he/she has taken. SixthSense also lets the user draw icons or symbols in the air using the movement of the index finger and recognizes those symbols as interaction instructions. For example, drawing a magnifying glass symbol takes the user to the map application or drawing an '@' symbol lets the user check his mail. The SixthSense system also augments physical objects the user is interacting with by projecting more information about these objects projected on them. For example, a newspaper can show live video news or dynamic information can be provided on a regular piece of paper. The gesture of drawing a circle on the user's wrist projects an analog watch.

The current prototype system costs approximate $350 to build.

The SixthSense prototype contains a number of demonstrations applications.

  • The map application lets the user navigate a map displayed on a nearby surface using hand gestures to zoom and pan
  • The drawing application lets the user draw on any surface by tracking the fingertip movements of the user's index finger.
  • SixthSense also implements Augmented reality; projecting information onto objects the user interacts with.

The system recognizes a user's freehand gestures as well as icons/symbols drawn in the air with the index finger, for example:

  • A 'framing' gesture takes a picture of the scene. The user can stop by any surface or wall and flick through the photos he/she has taken.
  • Drawing a magnifying glass symbol takes the user to the map application while an '@' symbol lets the user check his mail.
  • The gesture of drawing a circle on the user's wrist projects an analog watch.
  • we can play motion games via piece of paper and digital information about a product can be know instantaneous   

NEWSPAPAER SHOWING LIVE VIDEO NEWS

  

DRAWING A CIRCLE ON THE WRIST SHOWS TIME

 

BOARDING PASS SHOWING THE LIVE FLIGHT TIMES


12 December 2011

Device That Allows Using the Arm as a Touch screen - Muthu Visagan V, III B.Tech IT 'A'

Device That Allows Using the Arm as a Touch screen

Chris Harrison at Carnegie Mellon University and Dan Morris and Desney Tan at Microsoft's research lab in Redmond, Washington, recently came up with their latest invention called Skinput, which represents a skin-based interface that makes it possible for a person to use his or her palm as a touch screen. The Skin put can be used to play games, control various devices, make phone calls and surf the Internet. The invention features a keyboard, menu and a number of other graphics that appear of the user's palm and forearm. The graphics are generated by a pico projector that in incorporated in an armband. When the user touches a certain point on his or her palm, the acoustic detector in the armband identifies the part that was activated and performs the respective action. Scientists explain that the differences in bone density, size and mass, along with filtering effects from a person's soft tissues and joints, imply that various locations on the user's skin have different acoustic features. It is worth mentioning that the acoustic detector used in this invention is able to identify five skin locations, registering an accuracy of about 95.5 percent.

Using wireless technology, the researchers' latest invention can convey the signals to a cell phone, iPod or computer. The system was tested by 20 volunteers who gave a positive response to the device and its ability to provide fast navigation. Researchers look forward to present their latest invention in April at the Computer-Human Interaction conference which will take place in Atlanta, Georgia.

Finding the keypad on your cell phone or music player a bit cramped? Maybe your forearm could be more accommodating. It could become part of a skin-based interface that effectively turns your body into a touch screen.

Called Skin put, the system is a marriage of two technologies: the ability to detect the ultralow-frequency sound produced by tapping the skin with a finger, and the microchip-sized "pico" projectors now found in some cell phones.

The system beams a keyboard or menu onto the user's forearm and hand from a projector housed in an armband. An acoustic detector, also in the armband, then calculates which part of the display you want to activate.

But how does the system know which icon, button or finger you tapped? Chris Harrison at Carnegie Mellon University in Pittsburgh, Pennsylvania, working with Dan Morris and Desney Tan at Microsoft's research lab in Redmond, Washington, exploits the way our skin, musculature and skeleton combine to make distinctive sounds when we tap on different parts of the arm, palm, fingers and thumb.

Bone machine

They have identified various locations on the forearm and hand that produce characteristic acoustic patterns when tapped. The acoustic detector in the armband contains five piezoelectric cantilevers, each weighted to respond to certain bands of sound frequencies. Different combinations of the sensors are activated to differing degrees depending on where the arm is tapped.

Twenty volunteers tested the system and most found it easy to navigate through icons on the forearm and tap fingers to actuate commands.

"Skin put works very well for a series of gestures, even when the body is in motion," the researchers say, with subjects able to deftly scroll through menus whether they moved up and down or flicked across their arm.

The system could use wireless technology like Bluetooth to transmit commands to many types of device – including phones, iPods and even PCs. The researchers will present their work in April at the ACM Computer-Human Interaction meeting in Atlanta, Georgia.

Body control

Pranav Mistry of the Media Lab at the Massachusetts Institute of Technology warns that users will have to position the armband very precisely so the projection always appears in the right place.

Nevertheless, Skinput looks a promising idea, says Michael Liebschner, director of the Bio-Innovations Lab at Baylor College of Medicine in Houston, Texas, who has worked on bone acoustic conduction technology for gadget-to-gadget transmission.

"This sounds a very feasible approach to using the body itself as an input device," he says. "When you are immersed in a virtual game using a head-mounted 3D display, you cannot just take it off to fiddle around with control buttons. This will make things much easier."

 

MINI2440 - touch screen device open with O_NONBLOCK

  
int ghw_touch_init()
 {
 
     ts_dev = open("/dev/input/event0",O_RDONLY|O_NONBLOCK);
    if (ts_dev < 0) {
       printf("Touch screen driver: open device error!\n");
         printf("errno = %d, %s\n", errno, strerror(errno));
        return 0;
    }
    else {
    printf (" Touch screen device opened successfully! \n");
     return 1;
  }
  
  /*flush the ts device */
    ts_flush(ts_dev);
}
 
/*
   Simulates a touch screen input.
   Returns 1 if a touch change (an edge or pressed coordinate change) has
been detected
   Returns 0 if the touch level is unchanged (unpressed, or pressed and
same position).
 
   Edge  = 1, Touch screen pressed changed state.
    Edge  = 0, No touch state change.
   Level = 1, Touch screen pressed (x,y values is a (new) valid position)
   Level = 0, No touch (x,y values is the position where last touch
 stopped)
*/
  
unsigned char ghwTouchGet(unsigned char *edgep, unsigned char *levelp, int
*xp, int *yp)
{
   int nRet=0; // index;
   static unsigned short oldx = 0;
   static unsigned short oldy = 0;
   int x;
   int y;
   struct ts_sample; // *ts_samples_data, ts_data;
 
  screen_pressed=0;
  
  /* read samples from touch screen device */
  /*do {
    if (ts_input_read(ts_dev, &ts_samples[0], 1) < 0) {
      perror("ts_read");
      //close_framebuffer ();
      return;
    }
 
  } while (ts_samples[0].pressure == 0);
*/
  /* Now collect up to MAX_SAMPLES touches into the samp array. */
/*  index = 0;
  do {
    if (index < MAX_SAMPLES-1)
      index++;
    if (ts_input_read(ts_dev, &ts_samples[index], 1) < 0) {
       perror("ts_read");
      //close_framebuffer ();
      return;
    }
  } while (ts_samples[index].pressure > 0);
  printf("Took %d samples...\n",index);
 */
   getxy (ts_dev, &x, &y);
 
 //   printf ("After ts read: x=%d, y=%d \n", x, y); 
 
  /* check if x, y has changed from previous values
   * update return values accordingly
   */
  if (screen_pressed == 0) {
 
    *edgep = 0;
    *levelp = 0;
    *xp = oldx;
    *yp = oldy;
    return 0;
  }
  else {
    *edgep = screen_pressed;
    if (x != oldx || y != oldy) {
    
      *levelp=1;
      if (xp != NULL)
             *xp = (int)(x*xres/x_ts_max);
        if (yp != NULL)
           *yp = (int) (y*yres/y_ts_max);
         
      /* x = oldx; */ /* x pos */  /* we need to remap the touch screen
co-ordinates to frame buffer pixels */
        /* y = oldy; */ /* y pos */
      }
  }
 
    /* Change detection */
    nRet = (screen_pressed != 0 || (x != oldx) || (y != oldy)) ? 1 : 0; 
     oldx = x;
    oldy = y;
    return nRet;
}
 
 
int ts_input_read (int ts_dev, struct ts_sample *samp, int nr)
{
  struct input_event ev;
  unsigned char *p = (unsigned char *) &ev;
  int len = sizeof(struct input_event);
static struct ts_sample ts_temp;
  int ret = nr;
  int total = 0;
 
  while (total < nr) {
    ret = read(ts_dev, p, len);
    if (ret == -1) {
      if (errno == EINTR) {
        continue;
      }
      break;
    }
 
    if (ret < (int)sizeof(struct input_event)) {
      /* short read
       * restart read to get the rest of the event
       */
      p += ret;
      len -= ret;
      continue;
    }
 
       // printf("type= %d, code= %d, value=0x%04x\n", ev.type, ev.code,
ev.value);
        
        /*if event is SYNC event then dont have to record the x,y - just
 continue reading */
        if (ev.type == EV_SYN) continue;
        
    /* successful read of a whole event */
    if (ev.type == EV_ABS) {
      switch (ev.code) {
      case ABS_X:
        if (ev.value != 0) {
          samp->x = ts_temp.x = ev.value;
          samp->y = ts_temp.y;
          samp->pressure = ts_temp.pressure;
        } else {/*flush the ts device */
          fprintf(stderr, "tslib: dropped x = %d\n", ts_temp.x);
           continue;
        }
        break;
      case ABS_Y:
        if (ev.value != 0) {
          samp->x = ts_temp.x;
          samp->y = ts_temp.y = ev.value;
          samp->pressure = ts_temp.pressure;
        } else {
          fprintf(stderr, "tslib: dropped y = %d\n", ts_temp.y);
          continue;
        }
        break;
      case ABS_PRESSURE:
        samp->x = ts_temp.x;
        samp->y = ts_temp.y;
        samp->pressure = ts_temp.pressure = ev.value;
        samp->tv = ev.time;
        break;
        } /* end switch */
        
  #ifdef DEBUG
        fprintf(stderr, "RAW---------------------------> %d %d %d\n",
           ts_temp.x, ts_temp.y, ts_temp.pressure);