The architecture of the Analyze Desktop product is very different from the LAE Desktop product.
When you install the Analyze desktop product the application footprint consists of: Tomcat web application server, Analyze Server, H2 file based database. Whereas LAE Desktop consists of the BRE thick client and LAE Server.
This means that from a general high level point of view the Analyze product does require and use more system resources than that of LAE Desktop.
*** What can I do to improve the overall performance of D360? ***
In terms of execution processing - a lot of work has been done in Analyze to improve the execution performance – we have rewritten a lot of nodes and these have therefore superseded the old nodes.
For example:
- Agg Ex - renamed to "Agg Ex (Superseded)" and replaced by Aggregate
- Filter - renamed to "Filter (Superseded)" and replaced by Filter for basic filtering, and Transform for scripted transformations (now in Python and not Brainscript)
- All the Join nodes have been superseded and replaced with new nodes
- Sort - renamed to "Sort (Superseded)" and replaced by Sort.
Therefore a Data Flow in Analyze using the new nodes should perform better than the equivalent Graph using the old nodes in BRE.
It would be interesting to see your results as you start to migrate your Graphs from .brg files into Analyze and start swapping out the old nodes for the new ones.
*** Is there an option similar to the LAE "Run in aggressive mode" within D360 to help improve processing speed? ***
300 million records seems quite a lot to process on a Desktop, especially in a larger Data Flow, and given that, the answer to this is currently "no". In Analyze for ad-hoc runs where you go in and run nodes interactively, interim data is always written to file and only cleaned up when you re-run or clear the nodes.
There is a general setting you can use to determine how long to keep ad-hoc runs for, although that is just in <number of days>.
For scheduled runs, there are settings for when to delete temporary data, but again this is on completed runs rather than as the run is running.
- Starting-up D360 Analyze
Due to the difference in the product architecture, this will generally be slower
- Browsing and toggling through the Directory User Interface
This is something we know about, and will be improving in a future release.
- Opening a dataflow / Browsing and toggling thru a dataflow (e.g. entering and existing composite nodes) / connecting nodes and/or bending connectors / Toggling through the Properties panel
These are things that we know about, and are currently working on, you should see major improvements here once we move into the 3.8 series of Analyze releases.
- Opening data in DataViewer
This is something we know about, generally larger and wider datasets will take longer to load. Again it is something we will be looking at in the future.
In summary…
- Execution: should be faster than in BRE if you replace the superseded / deprecated nodes.
- Design time performance issues: we know about and are actively looking at and expect major improvements in coming releases.
- Resource usage: Due to architecture design changes however, the overall resource usage will be higher than LAE