How to annotate odt document with xml custom tags

 81 topics, 231 posts  » Share this       
Replies: 8 - Last Post: August 05, 2009 04:26
by: Homembit
showing 1 - 9 of 9
 
Posted: December 16, 2008 17:13 by Lucas de Oliveira
Hello there,
I'm very excited with the announcement that ODF 1.2 is much more flexible when it comes to metadata annotation. However I don't see much documentation (or I'm terribly blind, in which case please forgive the inconvenience) about how to annotate a odt document using custom xml (in my case I'll try to use an owl ontology). I'm working on my masters thesis (which surrounds a bit of semantic annotation in documents and also document data extraction) and would be awesome to show some use of the new features included on ODF 1.2. I've tried to add annotations on content.xm manually but OOo just shove them away every time I saved the doc. So I was wondering if there's a way to do that using the toolkit (not sure if ODFDOM would be the bet, but since I'm a bit lost I couldn't just forgot to ask). Any ideas, links to documentation, etc will be very helpful and I'll be glad to sync up with you guys while the work is developed.

best regards!
 
Posted: December 16, 2008 22:21 by Svante Schubert
Hi lucasdeoliveira,

the reason you have not stumbled over documentation so far, is that we are still in progress of finalizing this metadata feature for ODF 1.2.

The tweak the RDF metadata feature is one of the last hand full changes of the ODF 1.2 specification (see
http://lists.oasis-open.org/archives/office/200812/msg00085.html) and therefore even in OpenOffice.org the feature has to be adapted and therefore won't become full available before OOo 3.2).

Regarding ODFDOM, the Java API of 0.6.x is currently generated from the ODF 1.1 grammar (RelaxNG schema), which does not include xml:id attributes nor the RDF manifest feature, but we plan to deliver an ODF 0.7beta based on ODF 1.2 end of January 2009.

I don't think that we will have already a Java RDF parser with the first 0.7 release, but the convenient API for metadata is one of my personal favorite interests.

May ask you out of curiosity about some more details of your thesis work, your desired scenario?

Regards,
Svante

 
Posted: December 17, 2008 11:57 by Lucas de Oliveira
Hi there Svante,

"(...)but the convenient API for metadata is one of my personal favorite interests."

right on mate! Grin

"May ask you out of curiosity about some more details of your thesis work, your desired scenario?"

Sure! I'm putting together a few software engineering document templates (like use case specification, design specification, and so on) and I want to annotate some parts of it (text blocks, table cells/columns) with special tags that actually reflect properties/types on software engineering ontologies. Once the templates are ready, users could fill in the templates generating software artifacts that would be already "semantically rich" and I would be able to extract those information with transformation scripts or DOM-related applications. This is just part of my idea.

So far, as a workaround, I was pitching the use of xhtml with some microformats-like attributes. Since OOo only export files to xhtml and does edit it I don't think that's the best scenario.

This year's Openoffice.org Conference presented an article that really looks like what I'm trying to accomplish. The title of the presentation is "Transforming an OWL Ontology to an OpenOffice Document Template" (by Massoud Toussi) and I've already got in contact with the presenter although he didn't send me proper info (just links to the presentation, which I already had).

I would be grateful if you could point me some directions to tackle this scenario.

thanks a lot for your attention and keep up the good work!

cheers!
 
Posted: December 23, 2008 17:51 by Lucas de Oliveira
Hi there Svante,
do you think my strategy is way too audacious? Do you see anything that's really out of reality?

thanks a lot for your support!
merry Christmas and happy new year!
 
Posted: June 10, 2009 07:45 by jpullmann

Hello Svante,
there are some projects using ODF as an integrated dataformat to generate learning contents and packages (SCORM) [1,2,3]
Here annotations with metadata are crucial, but applied in an proprietary way due to currently lacking support in OOffice. You
mentioned enhancemements planed for OOffice 3.2 - could you please post more detailes on it ?
Many thanks
Jaroslav

[1] http://www.hs-fulda.de/index.php?id=6275&L=0&F=1
[2] http://www.eduworks.com/index.php/Research/XML-SCORM-Studio.html
[3] http://inka.fhtw-berlin.de/Herzog/HOME/2_Publikationen/2009_WI2009-RapidElearningAnsatzfuerSCORM.pdf
 
Posted: December 23, 2008 18:54 by Svante Schubert
Hi lucasdeoliveira,

although I can not see the whole picture of your software development scenario, it makes very much sense to annotate documents with semantics to easier process fragments of them.

Therefore from my view your plan does not seem audacious to me.
Just give your idea a chance, create some small prototype and see if it proofs your concept.

Good luck! Wink
Svante
 
Posted: April 01, 2009 15:21 by lzomatos
Hello lucasoliveira.

Do you speak portuguese?
I asking this because your name and I developing a related work with your thesis.

What your mail?

Best regards,
Luiz.
 
Posted: August 04, 2009 17:40 by Lucas de Oliveira
hi lzomatos,
yes I do speak portuguese (brazilian). My email is lucasdeoliveira [at] gmail [dot] com

Hope you've found something interesting regarding to this topic Smile

cheers!
 
Posted: August 05, 2009 04:26 by Homembit
Hi Lucas,

+1 to receive info about your work.

homembit [at] gmail [dot] com.

Best,

Jomar
showing 1 - 9 of 9
Replies: 8 - Last Post: August 05, 2009 04:26
by: Homembit
  • Powered by:
Terms of Use; Privacy Policy; Copyright ©2008-2010 (revision 20100226.3083a24)
 
 
loading
Please Confirm