• Views
  • Iteration Report
  • My Iteration Report
  •  
OMERO.server
  • Login
  • Help/Guide
  • About Trac
  • Preferences
  • Wiki
  • Timeline
  • Roadmap
  • Browse Source
  • View Tickets
  • Search

Context Navigation

  • Start Page
  • Index
  • History
  • Last Change

File Parsers

File parsers extract text from various file types and provide it as a Reader to the FullTextBridge for use during search indexing. Plain text formats can use the default fileParser bean, but any specialized format, such as PDF or RTF requires special libraries and special registration.

Configuration

Currently, configuration takes places solely in source:trunk/components/server/resources/ome/services/service-ome.api.Search.xml . Eventually, it should be able to replace file parsers at configuration or even runtime.

Available parsers

File type Parser
application/pdf http://pdfbox.org
text/xml (internal)
text/plain (internal)
text/csv (internal)

How to

For the necessary changes that need to be made, see r2260. The base class for FileParsers are: source:trunk/components/server/src/ome/services/fulltext/FileParser.java


See also : OmeroSearch

Download in other formats:

  • Plain Text

Trac Powered

Powered by Trac 0.11
By Edgewall Software.

Visit the Trac open source project at
http://trac.edgewall.org/