Loading…
DeveloperWeek 2014 Conference & Expo has ended
Tuesday, February 18 • 10:00am - 10:40am
TALK: Pattern: An Open Source Project for Creating Complex Machine Learning Applications

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Cascading Pattern is an open source project that takes models trained in popular analytics frameworks, such as SAS, Microstrategy, SQL Server, etc., and runs them at scale on Apache Hadoop. With Pattern, developers can use a Java API to create complex machine learning applications, such as recommenders or fraud detection. Pattern effectively lowers the barrier of adoption to Apache Hadoop for developers because developers can use existing skill sets to immediately begin building these complex applications.

This machine-learning library works by translating PMML – an established XML standard for predictive model markup – into data workflows based on the Cascading API in Java. PMML models can be run in a pre-defined JAR file with no coding required. PMML can also be combined with other flows based on ANSI SQL (Cascading Lingual), Scala (Scalding), Clojure (Cascalog), etc. Multiple companies have collaborated to implement parallelized algorithms: Random Forest, Logistic Regression, K-Means, Hierarchical Clustering, etc. Benefits include greatly reduced development costs and less licensing issues at scale – while leveraging a combination of Apache Hadoop clusters, existing intellectual property in predictive models, and the core competencies of analytics staff.

In this presentation, Concurrent, Inc.’s Alexis Roos, will provide sample code that will show applications using predictive models built in SAS and R, such as anti-fraud classifiers. Additionally, Alexis will compare variations of models for enterprise-class customer experiments.

Speakers
avatar for Alexis Roos

Alexis Roos

Senior Solutions Architect, Concurrent, Inc.
Alexis Roos is a Senior Solutions Architect focusing on Big Data solutions at Concurrent, Inc. He has over 18 years of experience in software and sales engineering, helping both Fortune 500 firms and start-ups build new products that leverage Big Data, application infrastructure... Read More →


Tuesday February 18, 2014 10:00am - 10:40am PST
Terra Gallery & Event Venue 511 Harrison St, San Francisco, CA 94105

Attendees (0)