Skip to main content

Blog

Learn About Our Meetup

5000+ Members

MEETUPS

LEARN, CONNECT, SHARE

Join our meetup, learn, connect, share, and get to know your Toronto AI community. 

JOB POSTINGS

INDEED POSTINGS

Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.

CONTACT

CONNECT WITH US

Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

Author: torontoai

[D] AI clones voice from 5 second recording

TwoMinutePapers clipResearch paper

Published in 2018, and I’ve yet to see any fuss made over this – as opposed to image face recognition. Isn’t it a big deal? Evidence fabrication can be taken to a whole new level – in criminal, political, personal, and other domains.

On another note, combined with GPT2, and some of the face simulation methods, shouldn’t one be able to make a fairly convincing conversational AI? If GPT2 can somehow be configured to turn off lecture-mode and shorten responses, it might be fully convincing to those who don’t look much into AI. (Though I question its ability to reply based on previous replies, which requires memory.)

submitted by /u/OverLordGoldDragon
[link] [comments]

[D] History of NLP: How Andrey Markov and Claude Shannon Made Language Models for Text Generation

These were painstaking procedures. Markov counted vowels and consonants in a Russian novel to make probability rules for which letter would appear next. 30 years later, Shannon took the idea a step further by creating statistical models of language and generating text according to the rules he devised. Shannon’s first output was “OCRO HLI RGWR,” but his model did get a bit better.

https://spectrum.ieee.org/tech-talk/robotics/artificial-intelligence/andrey-markov-and-claude-shannon-built-the-first-language-generation-models

submitted by /u/newsbeagle
[link] [comments]

[P] Traffic Analysis in Original Video Data

[P] Traffic Analysis in Original Video Data

Half a year ago, when I lived in an apartment with a view of Ayalon Road in Tel Aviv, I decided I couldn’t just let the data pass indifferently beneath my window.

I used my smartphone camera to record 81 short videos of the traffic; built a dedicated CNN to detect the vehicles (after the small, crowded cars in the videos were failed to be detected by several out-of-the-box networks) and trained it on a few manually-tagged video-frames; modified SORT algorithm to allow tracking a vehicle even when it does not overlap itself over adjacent frames (required due to particularly low videos frame-rate); and derived several insights from the resulted data, mainly regarding lane-transitions and the relations between density, speed and flux.

I believe that this nicely demonstrates the amounts of data surrounding us, and their accessibility using as trivial tools as a smartphone.

Any comments, questions and insights are welcome 🙂

A small demonstration is attached in the form of an unpolished poster.

For more details please visit the repo’s readme:

https://github.com/ido90/AyalonRoad

https://preview.redd.it/zrboq3n36ay31.png?width=1539&format=png&auto=webp&s=5bb8d99159046b75eeeee3d50563232563bcebb4

submitted by /u/ido90
[link] [comments]

First Time’s the Charm: Sydney Startup Uses AI to Improve IVF Success Rate

In vitro fertilization, a common treatment for infertility, is a lengthy undertaking for prospective parents, involving ultrasounds, blood tests and injections of fertility medications. If the process doesn’t end up in a successful pregnancy — which is often the case — it can be a major emotional and financial blow.

Sydney-based healthcare startup Harrison.ai is using deep learning to improve the odds of success for thousands of IVF patients. Its AI model, IVY, is used by Virtus Health, a global provider of assisted reproductive services, to help doctors evaluate which embryo candidate has the best chance of implantation into the patient.

Founded by brothers Aengus and Dimitry Tran in 2017, Harrison.ai builds customized predictive algorithms that integrate into existing clinical workflows to inform critical healthcare decisions and improve patient outcomes.

Ten or more eggs can be harvested from a patient during a single cycle of IVF. The embryos are incubated in the lab for five days before the most promising candidate (or candidates) are implanted into the patient’s uterus. Yet, the success rate of implantation for five-day embryos is under 50 percent, and closer to 25 percent for women over the age of 40, according to the U.S. Centers for Disease Control and Prevention.

“In the past, people used to have to implant three or four embryos and hope one works,” said Aengus Tran, cofounder and medical AI director of Harrison.ai, a member of the NVIDIA Inception virtual accelerator program, which offers go-to-market support, expertise, and technology for AI startups revolutionizing industries. “But sometimes that works a little too well and patients end up with twins or triplets. It sounds cute, but it can be a dangerous pregnancy.”

Built using NVIDIA V100 Tensor Core GPUs on premises and in the cloud, IVY processes time-lapse video of fertilized eggs developing in the lab, predicting which are most likely to result in a positive outcome.

The goal: a single embryo transfer that leads to a single successful pregnancy.

Going Frame by Frame

Embryologists manually analyze time-lapse videos of embryo growth to pick the highest-quality candidates. It’s a subjective process, with no universal grading system and low agreement between experts. And with five days of footage for every embryo, it’s nearly impossible for doctors to look at every frame.

Harrison.ai’s IVY deep learning model analyzes the full five-day video feed from an embryoscope, helping it surpass the performance of AI tools that provide insights based on still images.

“Most of the visual AI tools we see these days are image recognition,” said Aengus. “But with an early multi-cell embryo, the development process matters a lot more than how it looks at the end of five days. The critical event could have happened days before, and the damage already done.”

The company trained its deep learning models on a dataset from Virtus Health including more than 10,000 human embryos from eight IVF labs across four countries. Instead of annotating each video with detailed morphological features of the embryos, the team classified each embryo with a single label: positive or negative outcome. A positive outcome meant that a patient’s six-week ultrasound showed a fetus with a heartbeat — a key predictor of successful live births.

In a recent study, IVY was able to predict which embryos would develop a heartbeat with 93 percent accuracy. Aengus and Dimitry say the tool could help standardize embryo selection by reducing disagreement among human readers.

To keep up with Harrison.ai’s growing training datasets, the team upgraded their GPU clusters from four GeForce cards to the NVIDIA DGX Station, the world’s fastest workstation for deep learning. Training on the Tensor Core GPUs allowed them to leverage mixed-precision computing, shrinking their training time by 4x.

“It’s almost unreal to have that much power at your fingertips,” Aengus said. Using the DGX Station, Harrison.ai was able to boost productivity and improve their deep learning models by training with bigger datasets.

The company uses the deskside DGX Station for experimentation, research and development. For training their biggest datasets, they scale up to larger clusters of NVIDIA V100 GPUs in Amazon EC2 P3 cloud instances — relying on NGC containers to seamlessly shift their workflows from on-premises systems to the cloud.

IVY has been used in thousands of cases in Virtus Health clinics so far. Harrison.ai is also collaborating with Vitrolife, a major embryoscope manufacturer, to more smoothly integrate its neural networks into the clinical workflow.

While Harrison.ai’s first project is for IVF, the company is also developing tools for other healthcare applications.

The post First Time’s the Charm: Sydney Startup Uses AI to Improve IVF Success Rate appeared first on The Official NVIDIA Blog.

[P] Refit existing Spark ML PipelineModel with new data

Hi!

I’d like to refit an alerady existing PipeLineModel in my project from microbatch to microbatch.

I curently use a DecisionTreeRegressor. I load back the previusly used PipelineModel, set it’s stages to a pipeline and use that pipeline to refit with new data, but as I can understand my solution only saves the latest model.

I enclose my github repo/model for the easier understanding.

Does Spark Structured Streaming capable of streaming learning? Is it possible to refit my already fitted model with new data?

Do I have to use the RDD based Spark Streaming with StreamingLinearRegressionWithSGD?

submitted by /u/Hakuhun
[link] [comments]

[P] Deploy A Text Generating API With Hugging Face’s DistilGPT-2

https://towardsdatascience.com/deploy-a-text-generating-api-with-hugging-faces-distilgpt-2-9791b9f356f9

A step-by-step tutorial for deploying Hugging Face’s NLP models as web APIs on AWS. Uses open source tools and covers many more advanced infrastructure challenges (autoscaling, prediction monitoring, rolling updates, etc.)

submitted by /u/KindaKnowKarate
[link] [comments]