DJ Random Forest Example

The DJ Random Forest algorithm is super easy to setup.

STEP 1: If you haven't already downloaded Data Juggernaut's SSIS custom Random Forest component you can do so now by clicking here. Installation and troubleshooting instructions can be found here.

STEP 2:Open SSIS and drag the RandomForestTask on to a new or existing SSIS package. SSIS Custom Component DJRandomForest

STEP 3:Update the Random Forest Task properties. For the IgnoreFeatures property, you must surround each feature with single quotes. Also, with the IgnoreFeatures property, you need to comma seperate each string and enclose the entire string with brackets. If you don't, the program will fail. The installation comes with an example data directory that includes both training and testing datasets. These datasets can also be found on Kaggle. You can also submit your results to compare your score with the Kaggle community. SSIS Custom Component DJRandomForest

STEP 4:Verify that you have set the file location properties correctly for all of the files.

STEP 5:That's it! If you want to get fancy, you can import the results into a database.

Have questions about this tutorial or a dataset you're trying to predict? We'd love to hear from you! Email us at DataJuggernaut@gmail.com