XML filetype parser with embedded html
Thread poster: foremonly
Jan 8, 2013

Hello,
I have a few questions.

1. Is it possible to change segmentation rules for certain nodes within a parser? For example, I have an xml with list of keyword separated by commas (i.e. keyword1,keyword2,keyword3). Is it possible to have these segmented by the comma? In the editor view, I would like to have each keyword in a separate segment and exclude the comma as per the following:

keyword1
keyword2
keyword3

Please note that the keywords are found in a keyword node. The other nodes in the XML do not need to be parsed in this way.

This leads me to my second question...

2. Is it possible to set up sub parsers. For example, I want to parse the content in one node differently from another node and have the text segmented differently.

3. For HTML embedded XMLs, is it possible to import the settings from the HTML filetype into the XML filetype instead of manually setting embedded tag rules? It seems like a bunch of work to manually add all possible html in the XML filetype, when the default settings for the HTML filetype seem to be very comprehensive.

I would be very grateful if someone could provide some insight on these questions.

Thank you!
Emily


Direct link Reply with quote
 

SDL Community  Identity Verified
United Kingdom
Local time: 14:08
English
Tricky questions! Jan 9, 2013

Hi Emily,

I created a test file like this (maybe not exactly what you have but it may give you some ideas):


Then I created a filetype that does this:


So not perfect, but it may be enough to provide you with what you need, or a few examples for starters. The parser rules I used were these:


Note that I added some context to li which contains the lists seperated by commas. This allowed me to then add two embedded content rules like this:


The first rule captures any tag at all in the CDATA section where I placed my html content, and the second was simply a comma. I then made the comma a nontranslatable placeable and excluded it so it was help outside the segments:


So, not a perfect embedded html solution you were after but perhaps overall something you can work with?

Regards

Paul


Direct link Reply with quote
 
foremonly
TOPIC STARTER
Thanks! Jan 9, 2013

Thanks, Paul! Helpful as always!

I will have a look and see how it goes.


Direct link Reply with quote
 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

XML filetype parser with embedded html

Advanced search







PerfectIt consistency checker
Faster Checking, Greater Accuracy

PerfectIt helps deliver error-free documents. It improves consistency, ensures quality and helps to enforce style guides. It’s a powerful tool for pro users, and comes with the assurance of a 30-day money back guarantee.

More info »
BaccS – Business Accounting Software
Modern desktop project management for freelance translators

BaccS makes it easy for translators to manage their projects, schedule tasks, create invoices, and view highly customizable reports. User-friendly, ProZ.com integration, community-driven development – a few reasons BaccS is trusted by translators!

More info »



Forums
  • All of ProZ.com
  • Term search
  • Jobs
  • Forums
  • Multiple search