Removing Tags from a Translation Memory

Earlier this week a colleague contacted me with an issue. One of his agency clients sent him a Translation Memory, but much to his dismay, the TM was cluttered with unnecessary formatting tags that rendered it pretty much useless. This clutter is usually being referred to as a “Tag soup”, and it is primarily the result of a PDF or image file conversion into formatted text. A Tag Soup could also form out of style mismatch when copying and pasting formatted text between applications, for example, copying an HTML email message and pasting it into a Word Processor. The conversion process or style mismatch introduces all kinds of unnecessary formatting tags that need to be removed in a post-processing stage before the converted document can be worked on. Emma Goldsmith wrote an excellent and exhaustive article about how to get rid of a tag soup in documents before starting to work on them, but what if the tag soup is already in the Translation Memory?Continue reading →

Counting Inserted and Deleted Words in Track Changes

While pondering which topic should I write about first, along came a project, and with it an interesting challenge that I thought could turn into an article.

In some highly regulated industries such as Legal and Pharmaceuticals/Healthcare it is not uncommon to get requests for updating an existing translation where the edits in the new version of the source document are indicated by the Track Changes feature.
This type of project presents a unique challenge for quoting, scheduling and processing.Continue reading →

Introductory Post

Hi Everyone,
This is just a short introductory post to start things off. You can find more details about this blog in the About Translation Therapy page.

In summary, in this blog I will attempt to cover all sorts of relevant content to the translation profession and work. From topics about the business/commercial side of things, through technical issues and challenges, to any other idea, thought or subject that are, or might be, relevant.
I also wish for this blog to become a worthy supplement to the invaluable information and advice provided by other great translation related blogs (a representative list of which can be found in the About Translation Therapy page) which I follow, enjoy and draw inspiration from.

Although I would like to dive in and experiment with the design and adding/removing elements and features, my main priority is start adding some content. But little by little I will certainly work work towards tweaking and optimizing the blog in terms of design, functionality, and performance.

Please feel free to register to the RSS Atom feedFollow me [@HiFiText] on Twitter, connect with me on LinkedIn (all links are also available at the footer of the webpage), contact me via this blog (the contact form will be added soon) or leave a comment with feedback, suggestions, or anything else.

I hope that you will enjoy the read.