Announcement

**Ray** · Dec-13-2016, 02:28 AM

None of what you describe is typical, so I think the best way to proceed with diagnosing the problem is if you can send us the .docx files in question, and your .zcfg configuration file so we can try to reproduce the problem here.

Without seeing the files in question, I can't speculate what might be leading to the additional numbers you see being extracted. For example, if they are values from a hidden data field or otherwise.

Likewise with the summary text being extracted, I would have to see the document to determine what is extracted. Some data is easy to determine the order and layout from the file format structure, other data can be more difficult if they have floating layers or scripting that cannot be easily determined until rendering occurs.

The sooner you can get us the files (more than one DOCX if you want to show us how repetition of titles occur, and the ZCFG configuration file), the more data we have, the sooner we can hope to have a solution for you. You can find our email details in the Contact Us page.

**Russ Ballaam** · Dec-13-2016, 12:24 PM

Thanks Ray,

I have created a small Zip file and attached it to an e-mail which I have just sent to you at info@wrensoft.com. Hopefully, this will give you sufficient to come up with a fix.

I also spotted another issue to do with indexing v1.5 (Acrobat 6.x) PDFs, and have included this in the e-mail as well.

Many thanks and kind regards,

Russell

**Russ Ballaam** · Jan-02-2017, 09:50 PM

For anyone experiencing similar issues to those described above. The issue to do with replicated titles appearing in the DB was a failing on my part. The metadata in the original Office documents had not been updated, and we had been creating new documents with the same style as the original.

The issue to do with strange numbers appearing in the summary for Office documents was fixed by the 7.1.1011 release.

Thanks Ray,

Russ

Announcement

Problems indexing Office documents/extraction of metadata/plugins

Problems indexing Office documents/extraction of metadata/plugins

Comment

Comment

Comment