下面列出的网站内容(列出数据存储内容和可用组件界面允许列出界面)

优采云 发布时间: 2022-04-01 09:31

  下面列出的网站内容(列出数据存储内容和可用组件界面允许列出界面)

  14.4 列出数据存储内容和可用组件

  命令行界面允许列出数据存储内容和可用组件。如果需要,它的预期用途是帮助手动编辑分析文件。通过使用 -list 参数,您可以获得数据存储的元数据和允许您手动编写分析文件的 DataCleaner 组件。

  如果您查看 -usage 命令的输出,列出数据存储的内容非常简单。以下是使用示例数据库“orderdb”的几个示例:

  > datacleaner-console.exe -list datastores

Datastores:

-----------

Country codes

orderdb

> datacleaner-console.exe -list tables -ds orderdb

Tables:

-------

CUSTOMERS

CUSTOMER_W_TER

DEPARTMENT_MANAGERS

DIM_TIME

EMPLOYEES

OFFICES

ORDERDETAILS

ORDERFACT

ORDERS

PAYMENTS

PRODUCTS

QUADRANT_ACTUALS

TRIAL_BALANCE

> datacleaner-console.exe -list columns -ds orderdb -table employees

Columns:

--------

EMPLOYEENUMBER

LASTNAME

FIRSTNAME

EXTENSION

EMAIL

OFFICECODE

REPORTSTO

JOBTITLE

12345678910111213141516171819202122232425262728293031323334

  列出 DataCleaner 的组件是通过将 -list 参数设置为以下三种组件类型之一来完成的:ANALYZER、TRANSFORMER 或 FILTER:

  > datacleaner-console.exe -list analyzers

...

name: Matching analyzer

- Consumes multiple input columns (type: UNDEFINED)

- Property: name=Dictionaries, type=Dictionary, required=false

- Property: name=String patterns, type=StringPattern, required=false

name: Pattern finder

- Consumes 2 named inputs

Input column: Column (type: STRING)

Input column: Group column (type: STRING)

- Property: name=Discriminate text case, type=Boolean, required=false

- Property: name=Discriminate negative numbers, type=Boolean, required=false

- Property: name=Discriminate decimals, type=Boolean, required=false

- Property: name=Enable mixed tokens, type=Boolean, required=false

- Property: name=Ignore repeated spaces, type=Boolean, required=false

- Property: name=Upper case patterns expand in size, type=boolean, required=false

- Property: name=Lower case patterns expand in size, type=boolean, required=false

- Property: name=Predefined token name, type=String, required=false

- Property: name=Predefined token regexes, type=String, required=false

- Property: name=Decimal separator, type=Character, required=false

- Property: name=Thousands separator, type=Character, required=false

- Property: name=Minus sign, type=Character, required=false

...

> datacleaner-console.exe -list transformers

...

name: Tokenizer

- Consumes a single input column (type: STRING)

- Property: name=Delimiters, type=char, required=true

- Property: name=Number of tokens, type=Integer, required=true

- Output type is: STRING

name: Whitespace trimmer

- Consumes multiple input columns (type: STRING)

- Property: name=Trim left, type=boolean, required=true

- Property: name=Trim right, type=boolean, required=true

- Property: name=Trim multiple to single space, type=boolean, required=true

- Output type is: STRING

...

123456789101112131415161718192021222324252627282930313233343536373839404142434445

  点击这里返回DataCleaner文档的主目录

0 个评论

要回复文章请先登录注册


官方客服QQ群

微信人工客服

QQ人工客服


线