04 March 2013

MVCOM Email Parsing Component: What Content/Meta-data Does It Track

MVCOM is a COM component that software developers use to manage email content stored in Outlook .pst, .msg, or .eml files. It's the same technology used in Encryptomatic's email viewers, like Pst Viewer Pro.

Today a customer asked: 
When the component extracts a pst file, does it write to a temporary table in memory?

During the extract, what content/metadata of each file does it track? 


MVCOM component keeps temporary data in memory.
Such temporary data includes:
  • physical structure binary tree, also known as Node Database Layer.
  • logical structure binary tree, also known as Lists, Tables, and Properties (LTP) Layer
  • object structure, such as Messages/Folders objects, etc
  • Temporary data uses lazy initialization tactic of creation.
The temporary data table is empty after opening a PST/OST file, then it is being filled, step by step depending of course which data an application wants to extract.
For each PST/OST file MVCOM tracks header information, like:
  • Format version
  • ANSI/Unicode type
  • Encryption method
  • Store object location
  • etc
This information is low level and not available to end users.

For each message inside PST/OST the full set of content/metadata can be extracted by user.
The most common content can be extracted directly, for example
  • Subject
  • Sent/Receive datetime
  • From,
  • To/CC/BCC lists
  • message Body
  • Email file attachments and more.

MVCOM can read all native MAPI properties, so you can extract metadata, such as
  • Transport Headers,
  • Priority,
  • Labels,
  • ThreadIndex and more
  • (for additional information please refer MVCOM.MailItem.Properties method)

No comments:

Post a Comment