The Datastage EE configuration file is a master control file (a textfile which sits on the server side) for Enterprise Edition jobs which describes the parallel system. 28 Apr The Datastage configuration file is a master control file (a textfile which sits on the server side) for jobs which describes the parallel system. In Datastage, the degree of parallelism, resources being used, etc. are all determined during the run time based entirely on the configuration provided in the APT.

Author: Zujar Kigalmaran
Country: Jordan
Language: English (Spanish)
Genre: Marketing
Published (Last): 20 November 2012
Pages: 166
PDF File Size: 5.39 Mb
ePub File Size: 10.84 Mb
ISBN: 206-8-51812-237-2
Downloads: 27957
Price: Free* [*Free Regsitration Required]
Uploader: Dourisar

This is a 3 node configuration file. If this is the case then this space will be used.

Tutorial is just awesome. I am looking for some good blog sites for studying. The file defines 2 nodes dev1 and dev2 on a single etltools-dev server IP address might be provided as well instead of a hostname with 3 disk resources d1d2 for the data and temp as scratch space. Hence, if the hardware resource is not available to support the maximum parallelization, the performance of overall system goes down. There is a default configuration file available whenever the server is installed.

Pls keep on writing. Greens Technologies In Chennai. There is a default configuration file available whenever the server is installed. So basically in node1 and node2configurtion the resources are shared. As you might know when Datastage creates a dataset, the file you see will not contain the actual data.


This means that the disk and scratch disk specified is actually shared between those two logical nodes. From this we can imply that the nodes node1 and node2 are on the same physical node. Very useful content and also easily understandable providing. Hai you have to learned to lot of information about selenium Gain the knowledge and hands-on experience you need to successfully design,so you have more details visit this site.

If your underlying system should have the capability to handle these loads then you will be having a very inefficient configuration on your hands. It is possible that conductor node is not connected with the high-speed network switches.

NET but dont know indepth. Pools — Pools allow us to associate different processing nodes based on their functions and characteristics. This is an awesome post. So basically in node1 and node2all the resources are shared. Inspiring writings and I greatly admired what you have to sayI hope you continue to provide new ideas for us all and greetings success always for you. How datastage decides on which processing node a stage should be run?

However, like main configuration file, we can also have many startup configuration files. This script has a default name of startup.

I have learned a lot of new things from your blog. Wedding Makeup Artist in jaipur. Basically the configuration file contains the different processing nodes and also specifies the disk space provided for each processing node.


This is a 3 node configuration file.

What is expiry date? Core java training In Chennai. I found your blog while searching for the updates, I am happy to be here.

Understanding the datastage configuration file – ETL and Data Warehouse links

Thank you for benefiting from time to focus on this kind of, I feel firmly about it and also really like comprehending far more with this particular subject matter. How do you configure your system so datzstage you will be able to achieve optimized parallelism?

The resource keyword is configuuration by the type of resource that a given resource is restricted to, for instance resource disk, configkration scratchdisk, resource sort, resource bigdata.

Now if you look at node3 can see that this node is associated to the sort pool. What are the different options a logical node can have in the configuration file? Datastage EE configuration file defines number of nodes, assigns resources to each node and provides advanced resource optimizations and configuration.