Previous in Forum: Aluminum Alloys Most Susceptible to Corrosion   Next in Forum: Pulse Width Modulator Question
Close
Close
Close
7 comments
Rate Comments: Nested
Commentator
Peru - Member - New Member

Join Date: Oct 2006
Location: Lima Peru
Posts: 57

Microsoft Word Hexadecimal Format

07/16/2010 4:23 PM

Dear CR4 followers:

Can somebody tell me the way Microsoft Word saves the text in hexadecimal? I am able to read it with a software tool but can´t find any correlation with the ASCII or UNICODE, is there an especial code for this purpose?

Thanks

Ciro

__________________
There is allways another way
Register to Reply
Pathfinder Tags: hexadecimal microsoft word
Interested in this topic? By joining CR4 you can "subscribe" to
this discussion and receive notification when new comments are added.
Guru
Technical Fields - Technical Writing - New Member Engineering Fields - Piping Design Engineering - New Member

Join Date: May 2009
Location: Richland, WA, USA
Posts: 21017
Good Answers: 795
#1

Re: Microsoft Word Hexadecimal Format

07/18/2010 11:31 AM

Each pair of hexadecimal digits constitutes a byte, which should represent an ASCII character by means of a simple look-up table. (I think.):

0 = 0000
1 = 0001
2 = 0010
3 = 0011
4 = 0100
5 = 0101
6 = 0110
7 = 0111
8 = 1000
9 = 1001
A = 1010
B = 1011
C = 1100
D = 1101
E = 1110
F = 1111

__________________
In vino veritas; in cervisia carmen; in aqua E. coli.
Register to Reply
Anonymous Poster
#2
In reply to #1

Re: Microsoft Word Hexadecimal Format

07/18/2010 1:14 PM

Yes Tornado, what you had mention is what I was specting to find, but when I look into the word file in its "Hexadecimal view" I do not see any code that I could correlate with the text in the "Word view" of the file. To investigate it, I did several experiments, for example, I prepare a word file with only "AAAAAAAAAAAAAAAAAAAAAAAAA" and open it in its hexadecimal view to see the way it is stored but can´t find the correlation with ASCII nor with UNICODE, this is the reason way I am looking for a hand on this mater. Thanks for your opinion and interest Ciro

Register to Reply
Guru
Australia - Member - New Member

Join Date: Feb 2008
Location: Australia
Posts: 2181
Good Answers: 255
#3

Re: Microsoft Word Hexadecimal Format

07/18/2010 5:56 PM

I'll reply in this thread since it seem to be the original.

You may never see the "hex" characters that you expect in any output file if you are actually seeing the binary stored data converted to hex code.

This is stretching my memory from looooong ago but it goes a little like this.

The data will be broken into smaller portions for storage. In order to overcome the potential failure of a single bit in either storage, retrieval or data transfer, the system will inject a few extra bits that allow either error detection or even error correction within the data. From memory these are called "hamming" bits.

They are not injected at the end of the data stream, but rather at various bits within each "word". When you read the raw binary information, you will see not only the binary data, but also these additional bits. When that binary iss converted to Hex, it will not necessarily look anything like what you expected.

I might be completely off with this, as it is years since being involved in this sort of stuff.

__________________
Just an Engineer from the land down under.
Register to Reply
Commentator
Peru - Member - New Member

Join Date: Oct 2006
Location: Lima Peru
Posts: 57
#4
In reply to #3

Re: Microsoft Word Hexadecimal Format

07/18/2010 6:54 PM

Dear Just an Engineer: Yes, I know the bit or bits that are added to the data in transmiting the data but I did not knew that it is also used to strore data, any way I can assume that, for example, in the word file with the "AAAAAAAAAAAA" the code of an "A" plus the added bits should be always the same, so I should be able to find the repeting pattern but it it not the case. Thanks Ciro

__________________
There is allways another way
Register to Reply
Guru
Australia - Member - New Member

Join Date: Feb 2008
Location: Australia
Posts: 2181
Good Answers: 255
#5
In reply to #4

Re: Microsoft Word Hexadecimal Format

07/19/2010 1:58 AM

You will not necessarily see the repeating pattern. The Hamming bits are added within the code, not at the end and some software we were touching on 30 years ago was able to recognise "repeat" characters and store it as "10 x A" instead of "AAAAAAAAAA" to save memory space.

Good luck

__________________
Just an Engineer from the land down under.
Register to Reply
Commentator
Peru - Member - New Member

Join Date: Oct 2006
Location: Lima Peru
Posts: 57
#6
In reply to #5

Re: Microsoft Word Hexadecimal Format

07/19/2010 11:16 PM

Thanks for you comments Ciro

__________________
There is allways another way
Register to Reply
Member

Join Date: Feb 2015
Posts: 6
#7

Re: Microsoft Word Hexadecimal Format

03/06/2015 5:00 AM

I found a related tutorial here:

http://www.instructables.com/id/Unprotect-MS-word-Doument./?lang=zh

Hope it will be helpful.

__________________

Best regards.

Micagordon

Register to Reply
Register to Reply 7 comments
Copy to Clipboard

Users who posted comments:

Anonymous Poster (1); ciro (2); Just an Engineer (2); micagordon (1); Tornado (1)

Previous in Forum: Aluminum Alloys Most Susceptible to Corrosion   Next in Forum: Pulse Width Modulator Question
You might be interested in: Bar Code Scanners, Bar Code Cards, Bar Code Software

Advertisement