Multiple Split Methods
Split Every nth Element
The splitter creates a new split file every nth element at the specified depth.
Split When An Element Name Changes
Creates a new split file when the name of an element at the specifed depth changes.
Split When The Value Of Specified Attribute Changes
The splitter creates a new split file upon change in value of the attribute in an element at the specifed depth.
Split When Namespace Changes
Creates a new split file when the namespace in scope changes.
Split When a Comment, CDATA or ProcessingInstruction Occurs
Accepts a list containing any of these node types and creates a new split file when one of the listed node types
occurs and optionally contains specified text.
|
Useful Options
New: Preserve Structure.Ensures that each split file has the same structure as the source XML.
Preview Mode. The Wizard creates and displays the first split file only.
Header Element. Includes the
first element under the root (header) in each split file.
Read our blog article...
Depth. Specifies the element depth in the XML hierarchy for inclusion.
Root Element. Encapsulates each split file with the specified
root. If it has attributes, it automatically handles the quotes so the entire root is properly quoted for the script engine.
Include File. The specified file is inserted in each split file.
One use is to ensure each split file has the same structure as the source XML.
Append File. Inserts the specified file at the end of each
split file. When used with an Include File, each split file may be nested within multiple parent elements.
Threshold Element. Specifies the element in the source file at which the splitter begins processing, skipping over all preceding nodes.
Encoding. Specifies the encoding used to write the split files. utf-8, utf-16 and iso-8859-1 are currently supported.
Write Byte Order Mark. Specifies whether the splitter writes a byte order mark in each split file. This is useful when feeding the split files into other software that may either require it, or fire an exception if it occurs.
Write DOCTYPE. If a DOCTYPE node occurs, specifies if the splitter writes it in each split file, first only, or none. This is useful where a DTD containing named entities may not be available or needed.
|
How Does It Work?
XmlSplit uses an XmlReader to read and parse the input XML document. It evaluates the input parameters when each
node is read to determine if the node is to be written to the current split file or a new split file created.
Auto-numbered split files are named based on an output file parameter. It automatically handles file names with
spaces by encapsulating them with quotes.
Scripts may be used with Powershell and Windows Script Host or XmlSplit may be invoked with the Windows Command (Run).
Many XmlSplit customers receive large XML files electronically and need to split and import them into database tables. Calling an XmlSplit script from
another script allows the entire process to be fully automated.
XmlSplit will split XML files of any size, and was successfully used by a New Jersey
consulting firm to divide a single XML file 60 Gigabytes in size.
Rate of execution is constant with respect to size of the file being split because only a small segment is read into memory.
Use of the Wizard is optional. It automatically generates scripts for using XmlSplit in command-line mode. The Wizard also allows the XML to be split
directly without using a script, reports progress during the split and enables cancellation at any time.
|