Skip to main content


Learn About Our Meetup

5000+ Members



Join our meetup, learn, connect, share, and get to know your Toronto AI community. 



Browse through the latest deep learning, ai, machine learning postings from Indeed for the GTA.



Are you looking to sponsor space, be a speaker, or volunteer, feel free to give us a shout.

[D] How do you use unsupervised Learning methods with time-series data?

I have a question about a problem that I am trying to solve.

I have clinical data (time-series measurements), and I aim to understand patients’ problems. Every measurement is reporting data in slightly different way depends on the behavior of the patient / equipement used to monitor patient.

This later presents three challenges:

1/ missing data for some measurements for some time.

2/ normalization problem. we don’t know have a clear idea on min/max of medical values (I assume it is hardly predictable in some cases).

3/ Since labeling such data is very costly. I can get some labeled data but it would be really a small subset.

What do I have?

For the sake of an example, let’s say that I have three measurements (measurement A, measurement B, measurement C).

I have time series of measurement A, B, C for healthy patients (they recovered and they are staying in hospital for few days), and I have time series of measurement A, B, C for patients who struggle with some problems.

I only know that information. The idea is to categorize patient problems over time and use it in other places where some specialized doctors lack expertise to identify problems. How can I approach this?

A t1, t2,t3,<missing>,t5,t6
B t1, t2,t3,t4,t5,t6
C t1, t2,t3,X,t4,<missing>,t6

If I see these time series, I would say that it is patient is struggling with problem X

P.S: I have > hundered measurements.

Suggested approach

Since the three measurements don’t report data in the same time window, I averaged on time window T. I focused only on time series of sick patients. I tried a naive approach of apply clustering with temporal constraints. Since it;s a naive approach to the problem, I started looking/exploring other methods.

Questions: 1/ How can I leverage measurements of healthy patients (use it as a guide) and the little labeled data I have 2/ what are some of the methods that I can use for unsupervised learning to tag/cluster problems (doctor will later identify them)?

I am seeking advises/recommendations on methods to explore. Do you have any suggestions, ideas and papers to explore. I would be thankful.

submitted by /u/__Julia
[link] [comments]