How to get rid of junk OCR character leftover in Word Thread poster: Susan Welsh
| Susan Welsh United States Local time: 19:46 Russian to English + ...
I have converted a PDF to Word using ABBYY Finereader, and wherever there was a hyphen at a line ending, the Word version has put it a junk character than I cannot search and replace to get rid of. It looks like a horizontal line with a short vertical line hanging down from the back of it -- like an L rotated 90 degrees clockwise. I have copied it into my Find field, but Word can't find it. There are hundreds of these things in this rather long document, and I would really like to ... See more I have converted a PDF to Word using ABBYY Finereader, and wherever there was a hyphen at a line ending, the Word version has put it a junk character than I cannot search and replace to get rid of. It looks like a horizontal line with a short vertical line hanging down from the back of it -- like an L rotated 90 degrees clockwise. I have copied it into my Find field, but Word can't find it. There are hundreds of these things in this rather long document, and I would really like to get a clean text to make translating easier. Any suggestions? Thanks in advance! ▲ Collapse | | | Kevin Fulton United States Local time: 19:46 German to English Look under special characters | Apr 16, 2013 |
If I recall correctly, this is for the optional hyphen ^-. | | | Sam Pinson United States Local time: 17:46 Member (2011) Russian to English | LEXpert United States Local time: 18:46 Member (2008) Croatian to English + ...
This is very common in multi-column articles. Open Word's Find&Replace dialog. Under Find, click the button "More >>" Place the cursor in the Find box, and from the Special drop-down menu select "optional hyphen". Leave the Replace box blank. Replace All. That's it. | |
|
|
esperantisto Local time: 02:46 Member (2006) English to Russian + ... SITE LOCALIZER Better take care of it in FR | Apr 16, 2013 |
In FineReader, go to Tools → Options → 4. Save → Format Settings → RTF/DOC/Word XML and tick Remove Optional Hyphens and re-export your document.
[Edited at 2013-04-16 07:57 GMT] | | | Susan Welsh United States Local time: 19:46 Russian to English + ... TOPIC STARTER
I used Rudolf's solution, and it worked like a charm. (I didn't want to go back to FR, because I had already done some formatting work on the Word file, like moving footnotes around.) Thanks to all. | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » How to get rid of junk OCR character leftover in Word CafeTran Espresso | You've never met a CAT tool this clever!
Translate faster & easier, using a sophisticated CAT tool built by a translator / developer.
Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools.
Download and start using CafeTran Espresso -- for free
Buy now! » |
| Protemos translation business management system | Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!
The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.
More info » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |