Data Record Extractor Online Help
About Data Record Extractor
Introduction
Copyright Declaration
Registration & Order
How To Use
Overview
Extraction Templet
Insert Auto Increment Field
Insert A Database Field
Set Authorization/Login
Add A Column For Text Tag
Add A Column For RegExpr
Database Management
Create/Drop A Table
Create A Unique Field
Add Content To A Field
Reset Extraction Flag
Output Extraction Result
Save To Database
Save As .CSV File
Custom Output
Menu Description
Install & Uninstall
Install & Uninstall
FAQs
Frequently Asked Questions
Appendix
Syntax of Regular Expressions

Add A Column For Text Tag Extraction

Tex Tag is characters that separates the data from other wild characters, in web extracting, often known as HTML tags. For example: <td>THE DATA</td>, here, <td> is open tag, </td> is close tag. A pair of text tag corresponding to column, a serials of text tags composed a record, a record showing on grid known as a row.

Column Title

Column title is the name of the column, shows at column header of the result grid. If the checkbox Start/Record tag is checked, the meaning is Start Tag. Start Tag indicates the start position that the program start to extract, all characters before the start tag will be jumped over.

Open Tag

No more than 50 characters. There are several program pre-defined open tag, they have special format and meaning:

  • [..]: The open tag is the same as last open close tag, like this:

    DATA_1#DATA_2#DATA_3, the # act as both open tag and close tag, in this situation, you should write open tag as [..] and close tag as #.

  • X...Y: X & Y is one or more characters, this expression meaning "from X until Y to start", like this:

    <td width="100" class="xxx">THE DATA</td>
    You need to extract THE DATA, you can write the open tag as <td...> and the close tag as </td>

  • X{n}...Y: Like X...Y, just passby X n times, like this:

    <td xxx>data you needn't</td><td xxx>data you needn't</td><td xxx>THE DATA</td>
    You can write the open tag as <td{3}...> and close tag as </td>

When checkbox Start/Record Tag is checked, the open tag edit box act as record start tag, but what is record start tag? See below:

<tr><td xxx>Jimmy</td><td xxx>25</td></tr>
<tr><td xxx>Jennifer</td><td xxx> 20</td></tr>

You can set the record start tag as <tr> and the record end tag as </tr>, the record tag is optional, but it is helpful to make extraction more accurate.

Close tag

Unlike the open tag, close tag is simple. Only one pre-defined Carriage-Return & Line Feed(CRLF, also known as Enter), you can select from the drop down list.

Table field (optional)

If you want the extraction result save to database table, you should associatea extraction templet with a table and select a field for every column. Click here to learn about how to create a table.