Tagsets and Scanners Included in the Distribution

By default, ITPilot includes the following built-in tagsets:

  • All4_6. Used by default by the process which generates DEXTL programs using examples. It defines a tag for every HTML tag.
  • All4_6_xxx. A set of pre-generated tagsets that represents the most commonly used tagsets. They define a tag for every HTML tag except for those that appear in their name. For example, all4_6_anchor recognizes all the HTML tags except the anchor tag <a>.

By default, ITPilot includes the following built-in scanners:

  • StandardFullLexer4_6. Used by default by the process which generates DEXTL programs using examples. Uses the default tagset all4_6. Makes use of the lexer type that keeps blank spaces between tags (see section Lexer Types).
  • AutogeneratedLexer4_6_x. A set of pre-generated scanners that represents the most commonly used scanners. They use a subset of the tagsets previously mentioned. Make use of the lexer type that keeps blank spaces between tags (see section Lexer Types).

Due to backwards compatibility reasons all the scanners of earlier versions of ITPilot are included. Some of these scanners are considered deprecated and should not be used in new projects: StandardHTMLLexer, StandardHTMLLexerJS, StandardFormLexer, StandardFormLexerJS and StandardLexerJS.