Random Forest in Data Analytics

From MIT Technology Roadmapping
Jump to navigation Jump to search

Technology Roadmap Sections and Deliverables

Unique identifier:

  • 3RF-Random Forest

This is a “level 3” roadmap at the technology/capability level (see Fig. 8-5), where “level 1” would indicate a market level roadmap and “level 2” would indicate a product/service level technology roadmap.

Roadmap Overview

The high level workflow is depicted in the below.

Section 1.JPG

Random forest is an ensemble Machine Learning technique to boost the accuracy of prediction of future, based on data from the past. “Wisdom of crowd” Building block is decision tree; a voting scheme is used to determine the final prediction Commonly used ensemble approaches are booting, bagging, and stacking

Design Structure Matrix (DSM) Allocation

Section 2.JPG

The 3-RF tree that we can extract from the DSM above shows us that the Random Forest(3RF) is part of a larger data analysis service initiative on Machine Learning (ML), and Machine Learning is also part of a major marketing initiative (here we use online advertising as an example). Random Forest requires the following key enabling technologies at the subsystem level: Bagging (4BAG), Stacking (4STK), and Boosting (4BST). These three are the most common approaches in Random Forest, and are the technologies and resources at level 4.

Roadmap Model using OPM

Figures of Merit

Alignment with Company Strategic Drivers

Positioning of Company vs. Competition

Technical Model

Financial Model

List of R&T Projects and Prototypes

Key Publications, Presentations and Patents

Technology Strategy Statement