Difference between revisions of "Datainterchange DSL"
(→entity) |
(→createOn) |
||
Line 231: | Line 231: | ||
For more information about the entities, review the corresponding [http://download.osbee.org/documentation/index.php/Entity_DSL#Entities Entity DSL documentation]. | For more information about the entities, review the corresponding [http://download.osbee.org/documentation/index.php/Entity_DSL#Entities Entity DSL documentation]. | ||
− | ====createOn==== | + | ====Entity Keywords==== |
+ | |||
+ | With the <code>entity</code>, the user essentially specifies the projection of a source data file to the entities. This projection can be further fine-tuned with the following keywords. | ||
+ | |||
+ | =====createOn===== | ||
The <code>createOn</code> keyword followed by a string specifies on which input element should an entity to be created. | The <code>createOn</code> keyword followed by a string specifies on which input element should an entity to be created. |
Revision as of 16:03, 1 September 2017
A general introduction and some high-level concepts for OSBP DSLs can be found in OSBP DSL Documentation.
Please note that you can only make use of the Datainterchange DSL by downloading its packages (shown below on both Figure 1 and 2) from the download section of our website.
Contents
- 1 Purpose
- 2 Overview
- 3 Data Interchange Grammar Definition
- 4 Data Interchange Model File
- 4.1 Reserved Keywords
- 4.1.1 import
- 4.1.2 package
- 4.1.3 titel
- 4.1.4 interchange
- 4.1.5 describedBy
- 4.1.6 persist, merge, remove
- 4.1.7 file
- 4.1.8 elementSize
- 4.1.9 delimiter
- 4.1.10 skipLines
- 4.1.11 report
- 4.1.12 indent
- 4.1.13 quoteCharacter
- 4.1.14 encoding
- 4.1.15 beans
- 4.1.16 entity
- 4.1.17 Entity Keywords
- 4.1.18 expression -> assign...with...as | copy...from...property
- 4.1.19 format -> for...coding...locale
- 4.1.20 keys -> key
- 4.1.21 mapping -> map...to
- 4.1.22 lookup -> for...on...createOn...with...cacheSize...mapTo
- 4.1.23 marker -> makerPath -> markerEntity | markedBy
- 4.1 Reserved Keywords
- 5 Enums
- 6 Configuration Settings / Smooks Framework (A small introduction)
- 7 TriggerView & Executorservice
- 8 Annotations
- 9 Comments
- 10 Copyright Notice
Purpose
The Data Interchange DSL (datainterchange for short) is made for defining data exchange models that can be used to import data from various formats (CSV, XML, EDI, etc.), map the data to entities, store them into database, or export them back into other formats.
You only need to define the relationship between the file and the bean, not the import / export process themselves. Once defined, these models can be used in e.g. action DSL to define actions which, when triggered, execute the actual import / export process, which are generated automatically by the OSBP based on the model.
Overview
The Datainterchange DSL is currently to be found under both packages org.eclipse.osbp.datainterchange.api
and org.osbee.datainterchange
. While all the basic interfaces are defined inside the API package, in the latter one are all commercial features and components implemented such as configuration files and artifacts based on the Smooks framework.
As shown in Figure 3, the [DSL Inferrer] will generate various views and In/Export component according to model described by datainterchange DSL (and action DSL, in the case of ActionButtons). The action buttons, when clicked, will trigger their corresponding In/Export processes by putting WorkerThread (Runnable) jobs into the executor job pool within the TriggerView (prefixed with datainterchang name), buttons (and toolbar / menus containing them) are further included in the perspective.
Data Interchange Grammar Definition
Data Interchange Model File
Datainterchange DSL model files end with the .data
extension. Data Interchange models may be split into several .data
files, as long as they have the same package declaration.
Reserved Keywords
In the following we’ll dive deeper into the description and the usage of Datainterchange related and reserved keywords.
import
In the import section are all entities to be found - as full qualified names – that are currently used in the DSL.
import ns net.osbee.sample.foodmart.entities.Mstore
import ns net.osbee.sample.foodmart.entities.Mwarehouse
import ns net.osbee.sample.foodmart.entities.Mregion
Note that ns
is a mandatory keyword (stands for "namespace") that comes after import for distinguishing the OS.bee internal namespaces and Java library namespaces. Wildcards are not supported, all names should be imported separately. Note that the import section will be imported/added automatically if they are used in the package, so you don't have to manually manage this section.
package
Datainterchange DSL model files must start with a package declaration. Packages are the root element of the DSL and should be defined as <ApplicationName>.datainterchange
, for example:
package net.osbee.sample.foodmart.datainterchanges { }
Data Interchange models may be split into several .data files, as long as they have the same package declaration, the interchanges will be available under this package name.
titel
With the keyword title you can give a name to the corresponding TriggerView dialog inside your application. For example, the definition of the same datainterchanges package from above with title "Data Interchange Example":
package net.osbee.sample.foodmart.datainterchanges title "Data Interchange Example" {}
This title will be translated based on locale.
You can get more details about the TriggerView in the section below.
interchange
The interchange
keyword defines interchange units for entities in the package, it should be in the form of:
interchange <InterchangeUnitName> [describedBy <description>] <EntityManagerMode> file <FileType> [<FileDetails>] beans {
...
}
Where InterchangeUnitName
being the name of the interchange unit; describedBy
(optional) can be used to provide a short description string. EntityManagerMode
being how the file should be handled (see section "persist, merge, remove" below), and after file
keyword, you should specify the type of the source / target data file you would like to import from or export to, then the file path, and other details depends on the type.
The following example specifies an interchange that reads a CSV file under the specified path (note the forward slash as the path separator), delimited by semicolon, skip one line (the header), and treat the content as encoded in UTF-8:
interchange CurrencyNames persist file
CSV "C:/data/ISOCurrencyCode20170101.csv" delimiter ";" skipLines 1 encoding "UTF-8"
beans {
...
}
The beans
keyword comes after all the file specification and starts a block of entity definitions, which will be covered in section "entity".
describedBy
With this keyword you can the optional description of an interchange unit as shown below.
interchange Currencies describedBy "european central bank currency exchange rates based on euro" persist … {
}
persist, merge, remove
These keywords define the purpose of the datainterchange unit and has a similar meaning as in the JPA's EntityManager class. Basically, persist
will insert the data records into database, merge
will update existing data record, or insert new one if necessary, remove
will remove the record if it could be found in database.
Example persist:
interchange EDIOrders describedBy "Orders" persist file
XML "C:/data/orders.xml"
beans {
...
}
Example merge:
interchange Stores merge file
XML "C:/data/stores.xml"
beans {
...
}
Example remove:
interchange Storesremove remove file
XML "C:/data/stores_remove.xml"
beans {
...
}
file
With the keyword file you are able to set the file format of the files you intent to process with you interchange unit.
interchange <SampleInterchangeUnitName> <EntityManagerMode> file <FileNameFormat> {}
The current supported file formats are CSV, EDI and XML followed by the name of the file you want to process, given its full path location in the system.
interchange SampleInterchangeUnit1 merge file CSV "C:/temp/testFile.csv" {}
interchange SampleInterchangeUnit2 persist file XML "C:/temp/testFile.xml" {}
interchange SampleInterchangeUnit3 merge file EDI "C:/temp/testFile.edi" {}
After choosing the file format you can either give the file name as a String value in a double quote "..." as shown here above, or press Ctrl+Space to get via the content assist the option of opening a File Chooser/Picker to specify the file you want to work with.
Please note that you can also change the path(?) of the file to process on runtime by selecting a new file.
elementSize
With the keyword elementSize
followed by an integer, the user can set the estimated average size of the elements in bytes. Since the underlying API can not know the size of an element before it is processed, this value can be supplied as a guide value to be used for estimating the import / output progress based on how much bytes have been processed.
Example:
interchange EDIOrders describedBy "edi orders" persist elementSize 50 file EDI "C:/data/orders.edi" beans {
...
}
Defines an average estimated element size of 50 bytes.
delimiter
The delimiter
is a CSV format-specific keyword, which defines the character to use in the CSV file to set the delimiter/separation character between different values. Default value is "," (comma).
Example:
interchange CurrencyNames persist file
CSV "C:/data/ISOCurrencyCodes081507.csv" delimiter ";" ... {
...
}
will set the delimiter of the CSV file to ";".
skipLines
This is a CSV format-specific keyword. Using skipLines
followed by an integer, the user can specify the number of lines to be skipped from the beginning in the processing of the selected file, this can be used to skip headers.
Example:
interchange CurrencyNames persist file CSV "C:/data/ISOCurrencyCodes081507.csv" ... skipLines 1 encoding "UTF-8" {
...
}
will skip the first line of the CSV file.
report
If a datainterchange unit is defined with the report
keyword, a report will be generated for data conversions. The report file is generated by Smooks, lies under /smooks
output directory of the datainterchange bundle, and has a name in the form like <DataInterChangeName>-input.xml
for input processes, and <DataInterChangeName>-output.xml
for output processes. Note: turn on report will have an impact performance.
indent
The indent
is a CSV-format specified keyword which adds indentation character data to the generated event stream. This simply makes the generated event stream easier to read in its serialized form and generally should only be used in testing.
quoteCharacter
The quoteCharacter
is a CSV format-specific keyword, which defines the character to use in the CSV file to identify values.
encoding
With the keyword enconding followed by the encoding name as a string value you are able to specify the valid encoding of the file content.
Example 1:interchange SampleInterchangeUnitName merge elementSize 50 file CSV "C:/temp/testFile.csv" delimiter ";" skipLines 1 encoding "UTF-8"{}
interchange SampleInterchangeUnitName remove elementSize 50 file CSV "C:/temp/testFile.csv" delimiter ";" skipLines 1 encoding "GB18030"{}
interchange SampleInterchangeUnitName persistelementSize 50 file CSV "C:/temp/testFile.csv" delimiter ";" skipLines 1 encoding "ISO-2022-JP"{}
beans
The keyword beans
starts the series of entities, see example below. The name "bean" comes from the internal entities called JavaBean, which act as data containers.
entity
With the keyword entity the user can specify the entity on which the interchange unit is based on.
Example:
interchange Stores merge elementSize 83 file
XML "C:/data/stores.xml" beans {
entity Mregion
entity Mstore
entity Mwarehouse
}
For more information about the entities, review the corresponding Entity DSL documentation.
Entity Keywords
With the entity
, the user essentially specifies the projection of a source data file to the entities. This projection can be further fine-tuned with the following keywords.
createOn
The createOn
keyword followed by a string specifies on which input element should an entity to be created.
expression -> assign...with...as | copy...from...property
format -> for...coding...locale
keys -> key
mapping -> map...to
lookup -> for...on...createOn...with...cacheSize...mapTo
marker -> makerPath -> markerEntity | markedBy
Enums
Enums are an abstraction of the Java enum
. They compile to enum
classes and can be used as properties in entities and beans.
The Datainterchange DSL provides 3 Enum types that can be used: PredefinedBeanEnum, PredefinedBeanTypeEnum, ProgessBarStyleEnum and the EntityManagerMode.
PredefinedBeanEnum
Contains the definition of the values now(NowDate) | start(StartDate) | UUID(UniversallyUniqueIdentifier)
used for ...
PredefinedBeanTypeEnum
Contains the definition of the values Date(Date) | Millis(Milliseconds) | Nanos(Nanoseconds) | random(Random) | execContext(ExecuteContext)
used for ...
ProgessBarStyleEnum
Contains the definition of the values none(none) | normal(normal) | important(important)
used for setting the style of a progressbar.
EntityManagerMode
Contains the definition of the values persist(persist) | merge(merge) | remove(remove)
used for specifying the Entity Manager Mode in order to decide how to manage data inside a database before committing any changes.
Configuration Settings / Smooks Framework (A small introduction)
The OS.bee implementation of Data Interchange interface is based on Smooks. Smooks is a Java framework for processing XML and non XML data (CSV, EDI, Java etc) by mapping data to JavaBeans, which can later be persisted, enriched (merge with existing data from other source), or converted and exported into other formats.
Smooks relies on a proper configuration file for the import / export processes. These configuration files are generated by the Datainterchange DSL automatically. Here is a brief introduction of how the generated Smooks configuration files work.
When a data import / export Smooks instance is initiated, it will be supplied with the generated config file. This file defines the actions to be performed upon certain events during the SAX parsing process. Here is an example:
1 <?xml version="1.0" encoding="UTF-8" standalone="no"?>
2 <smooks-resource-list xmlns="http://www.milyn.org/xsd/smooks-1.1.xsd" xmlns:csv="http://www.milyn.org/xsd/smooks/csv-1.2.xsd" xmlns:dao="http://www.milyn.org/xsd/smooks/persistence-1.2.xsd" xmlns:jb="http://www.milyn.org/xsd/smooks/javabean-1.2.xsd">
3 <jb:bean beanId="McurrencyNameStream" class="net.osbee.sample.foodmart.entities.McurrencyNameStream" createOnElement="/csv-set">
4 <jb:expression property="importDate">PTIME.nowDate</jb:expression>
5 <jb:wiring beanIdRef="McurrencyName" setterMethod="addToCurrencyNames"/>
6 </jb:bean>
7 <jb:bean beanId="McurrencyName" class="net.osbee.sample.foodmart.entities.McurrencyName" createOnElement="/csv-set/csv-record">
8 <jb:value data="/csv-set/csv-record/isoCode" decoder="String" property="isoCode"/>
9 <jb:value data="/csv-set/csv-record/name" decoder="String" property="name"/>
10 <jb:value data="/csv-set/csv-record/countries" decoder="String" property="countries"/>
11 </jb:bean>
12 <csv:reader fields="isoCode,name,countries" indent="false" separator=";" skipLines="1"/>
13 <params>
14 <param name="stream.filter.type">SAX</param>
15 <param name="inputType">input.csv</param>
16 <param name="smooks.visitors.sort">false</param>
17 <param name="input.csv" type="input.type.actived">C:/git/net.osbee.sample.foodmart/net.osbee.sample.foodmart.datainterchange/smooks-resources/ISOCurrencyCodes081507.csv</param>
18 </params>
19 </smooks-resource-list>
Further informations on the Smoooks framework are available and can be reviewed in its documentation you will find on the official website.
TriggerView & Executorservice
Annotations
Comments
Copyright Notice
All rights are reserved by Compex Systemhaus GmbH. In particular, duplications, translations, microfilming, saving and processing in electronic systems are protected by copyright. Use of this manual is only authorized with the permission of Compex Systemhaus GmbH. Infringements of the law shall be punished in accordance with civil and penal laws. We have taken utmost care in putting together texts and images. Nevertheless, the possibility of errors cannot be completely ruled out. The Figures and information in this manual are only given as approximations unless expressly indicated as binding. Amendments to the manual due to amendments to the standard software remain reserved. Please note that the latest amendments to the manual can be accessed through our helpdesk at any time. The contractually agreed regulations of the licensing and maintenance of the standard software shall apply with regard to liability for any errors in the documentation. Guarantees, particularly guarantees of quality or durability can only be assumed for the manual insofar as its quality or durability are expressly stipulated as guaranteed. If you would like to make a suggestion, the Compex Team would be very pleased to hear from you.
(c) 2016-2024 Compex Systemhaus GmbH