Skip to main content

Blog

Learn About Our Meetup

5000+ Members

GO >

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community.

JOIN

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

JOBS

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

CONTACT

[P] Training Random Forest with a single vector (for each obs) in h2o?

I’m starting to use h2o to train and serve models. I have a dataset that I’d already curated for Spark ML pipelines. I have a single 16D vector I pass as the training data for each observation.

A friend said that h2o requires columns for each category and treats my single vector as a string, which I just can’t find anything to support. The accuracy is around what I got out of Spark ML, but I’m worried about how h2o is handling my data. Does anyone know how h2o handles this case?

tl;dr – Can I use a single vector for each training observation in h2o?

submitted by /u/Octosaurus
[link] [comments]