Author: torontoai

Success rate/scoring of categorical features as features? [Discussion]

Written on August 20, 2019. Posted in Reddit MachineLearning.

Hi all,

Let’s say I have a dataset with a mix of continuous and categorical variables and I’m creating an imbalanced binary classification model. A certain categorical variable has many (1000’s) of non-ordinal values. This feature is very important, certain values have high success rates ( num success(num rows with value and flag of 1)) / num instances (all rows with value) ). I have created features that include cat_var_num_success, cat_var_ success_rate, and a feature that is a score of each value of the categorical feature. The score assigns the mean overall success rate as the score if the value low sampling, if the value has a sufficient number of observations, and the success rate is greater than the mean overall success rate, the score is raised, the score is lowered if the value performs worse than the overall mean.

These generated features have proven to be highly predictive and improve the model (xgb). My concern comes from the fact that I have calculated these values using the whole dataset, which I subsequently split into train-test. I am afraid that the performance of the model is increasing due to information testing information leaking into training via the generated features.

Should I create an additional holdout set which does not contribute to the calculations for feature generation and test on that?

Thoughts?

Any feedback is appreciated!

submitted by /u/JohnnyCaggz
[link] [comments]

[P] Text classification w/ pytorch-transformers using RoBERTa

Written on August 20, 2019. Posted in Reddit MachineLearning.

Hi I just published a blog post on how to train a text classifier using pytorch-transformers using the latest RoBERTa model. Colab notebook is available: https://rsilveira79.github.io/fermenting_gradients/machine_learning/nlp/pytorch/text_classification_roberta/

submitted by /u/rsilveira79
[link] [comments]

[D] The state of transfer learning in NLP

Written on August 20, 2019. Posted in Reddit MachineLearning.

http://ruder.io/state-of-transfer-learning-in-nlp/

This blog post by Sebastian Ruder is a quick review of how natural language processing has benefited from transfer learning. He ties together how recent advances (e.g., pretrained models/BERT, optimization schemes, multitask fine-tuning, etc) can work together to improve language modeling, and also poses some open problems in the field. See also the (somewhat empty) HN discussion.

submitted by /u/jwuphysics
[link] [comments]

[Research] Help us record data for research on NLP from Radio Data for Agriculture

Written on August 20, 2019. Posted in Reddit MachineLearning.

Hello lovely inhabitants of r/MachineLearning. I am doing a residency at Artificial Intelligence and Data Science Research Lab, Makerere University, Uganda. We are collecting speech data for use in analysing radio programs related to agriculture, we will use this data to help in mapping and understanding the spread of crop disease.

Help us out by taking some time and recording some words.

Go here to help us record or there to view our work.

submitted by /u/ghost_shaba7
[link] [comments]

[D] Looking for opinions on using 3d models for image recognition.

Written on August 20, 2019. Posted in Reddit MachineLearning.

I was thinking about why the training process for image recognition is such a resource intensive task, and I got caught up on the idea that information loss caused by using projections (i.e. pictures) as training data would be an interesting area to explore.

In comes mesh based 3d models. The training subject now has all of the necessary features in a single example My question is twofold. Do you see any potential in using 3d models as a source for an ML algo/NN, and if yes, how would you go about doing it?

I’ve done a bit of brain storming, but I can’t really get anywhere. My initial thoughts were decompose the problem into parts, and solve it from both ends. One challenge is getting an isolated object from an image to use as a target, and the other is decomposing a 3d model into a useful 2d projection that can be compared with the target.

Figuring that the 2D part would be easier, I decided to tackle that first, and I found the task of meaningful image segmentation to be a bit more difficult than I expected. It essentially came down to this: I can use pre existing methods to segment an image, or I can make my own; but nothing actually does decent job of pulling out objects with the exception of NNs which need to be trained to do so. In an attempt to improve the training process, I essentially run into the need to use methods that I’m trying to replace.

The 3D part has a variety of challenges as well. I figured I could do something like take an image of the model that’s in similar proportion to the image that contains my target, then try to align the centroid of the model with the centroid of the target. This would eventually get to needing to iteratively translate, rotate, and scale the model. Some other obvious issues include segmenting the 3d model, posing the model, and finding what model to use as a reference when there are multiple ones saved.

Thoughts?

submitted by /u/seek_freedom
[link] [comments]

What’s Up, Doc?: AI Startup Gives Patients the Power

Written on August 20, 2019. Posted in NVIDIA.

AI, when applied to healthcare, holds the promise of better medical predictions and faster medicinal improvements. To do so, it needs data on which to train. But healthcare data is sensitive and private, creating a dead end.

Or not. Walter De Brouwer, CEO of Silicon Valley startup doc.ai, joined AI Podcast host Noah Kravitz to discuss how his medical research-based platform makes the application of deep learning to healthcare possible.

Many consumers worry about the outcome of putting their data in the cloud, where it risks being pirated. And larger institutions like hospitals that have an abundance of healthcare data are reticent to share it because it could reveal other sensitive business information.

Doc.ai first collects everything it needs from the device and user input. It takes into account data from Apple Health, blood tests, urine analysis, and any other medical information uploaded by the client.

The platform has eight prediction models, which exist locally on the client’s device. Each module has a specific focus, from the general medical record, to urine sampling, to phenomics.

These models transform the data into tensors — what De Brouwer calls “a big pile of numbers.” They’re then uploaded to the cloud with no risk of being pirated, because without the model that produced them, they’re like “GPS coordinates on planet Jupiter – you can’t do anything with it.”

From there, doc.ai can use these tensors to improve its deep learning algorithms and improve medical predictions.

Doc.ai also provides a platform to run medical trials. Clients and doctors can create their own, or take part in the three that De Brouwer has already organized. The first is organized in collaboration with advisors from Harvard Medical School, and focuses on allergy triggers. Another, studying the various combinations of the 26 medicines designed to treat epilepsy, was designed alongside Stanford specialists.

De Brouwer is no stranger to entrepreneurship. He’s the cofounder of several AI-based companies, including Inui Health, which performs urine analysis using machine learning and mobile phones, and XY.ai, a spinoff of Harvard Medical School that uses AI for large-scale digital twin technology.

De Brouwer is certain that doc.ai has found the ideal approach for improving healthcare knowledge. “This is Darwinism,” he says. “First you collect, then you predict, and then you change the bad things and amplify the good things. These three steps are basically the evolution algorithm of the planet.”

To learn more about doc.ai, visit their website here. Or visit their blog for weekly stories, as well as Github commands and Jupyter notebooks.

Help Make the AI Podcast Better

Have a few minutes to spare? Fill out this short listener survey. Your answers will help us make a better podcast.

How to Tune in to the AI Podcast

Get the AI Podcast through iTunes, Castbox, DoggCatcher, Overcast, PlayerFM, Podbay, PodBean, Pocket Casts, PodCruncher, PodKicker, Stitcher, Soundcloud and TuneIn. Your favorite not listed here? Email us at aipodcast [at] nvidia [dot] com.

The post What’s Up, Doc?: AI Startup Gives Patients the Power appeared first on The Official NVIDIA Blog.

[P] OpenCL framework with Dense layers

Written on August 20, 2019. Posted in Reddit MachineLearning.

Hi everyone,

I made simple DL framework on OpenCL just with Dense layers and several activations and losses: https://github.com/Airplaneless/Hallgerd

Matrix multiplication performance

MLP performance

Syntax is similar to Keras:

In [1]: from hallgerd.core import Sequential gpu = Device(devices['GeForce GTX 660'], DTYPE=np.float32) model = Sequential(device=gpu, lr=1e-1, batch_size=1024, epochs=40, loss='cross_entropy') In [2]: model.add(Dense(200, 200, activation='relu')) model.add(Dense(200, 5, activation='softmax')) In [3]: model.fit(X, y) # here X.shape = (feature size, dataset size) # y.shape = (output size, dataset size) In [4]: yp = model(X)

I think performance is quite sufficient on devices without PyTorch or Tensorflow support.

https://i.redd.it/4m3g6kfcmsh31.jpg

submitted by /u/airplanelesss
[link] [comments]

[P] AmpliGraph 1.1.0: user-friendly knowledge graph embeddings library with state-of-the-art results

Written on August 20, 2019. Posted in Reddit MachineLearning.

We have just released a new version of AmpliGraph. It is available from pypi, all you have to do is run pip install –upgrade ampligraph. It requires TensorFlow 1.13 or 1.14 to be installed.

For beginners, we have two new tutorials: * Basics with Game of Thrones data * Clustering and classification with football data

For advanced users, we have a new knowledge discovery API, including discover facts, clustering, near-duplicates detection, and topn query.

We also provide state-of-the-art results on two benchmark datasets (WN18RR and YAGO3-10), plus competitive results on other benchmarks. These results are fully reproducible.

The updated documentation with the complete changelog is available here: http://docs.ampligraph.org

If you have any questions or feedback, you can comment here or raise an issue on GitHub.

submitted by /u/tabacof
[link] [comments]

[D] Speech Pronunciation Recognition (and Feedback for pronunciation coaching)

Written on August 20, 2019. Posted in Reddit MachineLearning.

Hi all,

I was checking out this video about an ‘AI-enabled speech pronunciation coaching app’ where it records your voice and gives feedback on your (English) pronunciation. With details on which part of the word needs more practice. Please watch a snippet of the video demonstrating it here.

Can someone recommend me papers/repositories explaining the algorithms and method that (most likely) enables the app to do exactly this?

My take on it will be using ASR (Automatic Speech Recognition) to do speech to phonemes and phonemes to a special grammar (e.g. ARPABET). Then the recognised speech, in the special grammar, is compared to the truth. If it matches, then your pronunciation is perfect. Otherwise, it will give you feedback on the specific part of the word that needs practice.

Thanks!

submitted by /u/VividFee
[link] [comments]

[D] Radeon RX Vega M GL for machine learning (on a Intel NUC 8 mini PC)

Written on August 20, 2019. Posted in Reddit MachineLearning.

I’ve been seeing the Intel NUC 8 mini PCs going on sale. Specifically the NUC8i7HNK2 with an i7-8705G 3.10 Ghz, Kaby Lake G processor and a Radeon RX Vega M GL is on sale for $700 (pretax) without the RAM or SSD.

This is a great value! I need a portable machine learning teaching tool. A big bulky gaming GPU laptop is not that great (unwieldy, price and performance wise). These Intel NUCs you can just throw in the luggage and plug it in anywhere to do a demo or try out an idea before I go out and rent a cloud GPU to do the heavy lifting.

But is that RX Vega M GPU any good for machine learning?

I know last year they didn’t support CUDA but is this still an issue with libraries? I tried digging into documentation but it’s unclear.

submitted by /u/Jigsus
[link] [comments]

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

JOB POSTINGS

CONTACT

Author: torontoai

Success rate/scoring of categorical features as features? [Discussion]

[P] Text classification w/ pytorch-transformers using RoBERTa

[D] The state of transfer learning in NLP

[Research] Help us record data for research on NLP from Radio Data for Agriculture

[D] Looking for opinions on using 3d models for image recognition.

What’s Up, Doc?: AI Startup Gives Patients the Power

Help Make the AI Podcast Better

How to Tune in to the AI Podcast

[P] OpenCL framework with Dense layers

[P] AmpliGraph 1.1.0: user-friendly knowledge graph embeddings library with state-of-the-art results

[D] Speech Pronunciation Recognition (and Feedback for pronunciation coaching)

[D] Radeon RX Vega M GL for machine learning (on a Intel NUC 8 mini PC)