Jump to content United States-English
HP.com Home Products and Services Support and Drivers Solutions How to Buy
» Contact HP

HP.com home


HP Labs India

Research - Paper in the Digital Enterprise Project

» 

HP Labs

» Research
» News and events
» Technical reports
» About HP Labs
» Careers @ HP Labs
» People
» Worldwide sites
» Downloads
Content starts here
Intelligent Document Cleanup
 

Document cleanup is a necessary preprocessing step for effective digitization of paper. Our research deals with the whole gamut of issues related to document cleanup including: How do you determine the type of noise that is present in a document? How do you optimize the cleanup of the documents to make them more suitable for downstream processing? How do you evaluate the goodness of the cleanup that has been carried out on a document? Our aim is to come up with a holistic approach for document cleanup that would work across the whole range of documents with all types of potential noise.

 

Paper as an Interface
 

Handwritten annotations are often used in offices and business processes. While moving from the digital world to the paper world is relatively easy moving the other way around for capturing handwritten information and annotations is a hard problem.

Our work focuses on intelligently segregating the ink from the paper document and recognizing it accurately. How can you distinguish handwritten text from printed text effectively? Also how well can you recognize the extracted handwritten ink? While OCR of printed text for the Roman script has achieved high-level of performance, there is still substantial work required for handwritten text.


Paper Widgets
 

Enterprises have significant problems processing paper documents. There is significant amount of time and effort spent in terms of transcribing information from paper documents into IT systems. Our research here tries to come up with new ways to alleviate some of these problems. It includes work on new forms of machine readability and associated document image processing to help detect and deal with these new forms. We feel the research can ultimately have significant impact in terms of enterprise scanning workflows.

 

Secure Print Workflows
 

Typically the paper document is the weakest security link in enterprise’s information flows. The research on secure print workflows aims to provide the same level of security to the printing process that one has in terms of access to digital information in an enterprise. The research aims to come up with methods by which unauthorized printing can be prevented and even when it has been carried out can be subsequently detected.

 

 


  back     top
 

Home

 

Who We Are

     
Director's Message
Director's Biography
Our People
 

Research

     
Paper in the Digital Enterprise
Intuitive Multimodal and Gestural Interaction
Simplifying Web Access for the Next Billion
Technology in Education
   

Opportunities

   
Careers
Internships
 

Collaboration

   
Universities
BITS-HP Labs India PhD Fellowship
IIITB-HP Labs India PhD Fellowship
   

News & Information

   
Lectures
Workshops and Conferences
Awards
Publications
Press
 

Downloads

   
Lab Brochure
Whitepapers
Demos
 
 

Contact Us

 

     
     
Printable version This page was last updated on May 21, 2009
Privacy statement Using this site means you accept its terms Feedback to HP Labs
© 2009 Hewlett-Packard Development Company, L.P.