When To Use A Gaussian Mixture Model

Computer Science Level 2

Which of the following data sets is most likely to be well-modeled by a Gaussian mixture model ?

Numbers of pregnancies across many humans Speeds across cars just before reaching a specific traffic light Student scores on a specific standardized test

5 solutions

Gabe Smith
Aug 4, 2017

Student scores on a standardized test are likely to be normally distributed, but there is no reason to think that the distribution would not be unimodal without more information. Here's an example of SAT scores.

Numbers of pregnancies across humans is tempting, but probably not a good fit. There are two reasons: if we don't separate males/females, then the distribution is simply not going to be multimodal. If we do separate males/females, we have two issues: (a) the females don't actually follow a normal distribution - it often looks something like this , and (b) this model will do a poor job of distinguishing all of the males with 0 and the sizable number of females with 0. If anyone has ideas for other cohorts to use to fit a GMM to pregnancies, post a comment!

Speeds are a great fit, though! When the light is green (and some yellow times), the cars will be going around the speed limit. When the light is red (and some yellow times), the cars will be close to a stopping speed (0). A GMM with two Gaussian distributions will fit this data reasonably well.

If we separate nationality in numbers of pregnancies across humans, would it possibly be multimodal?

Fucai Zhu - 3 years, 3 months ago

I also have the same question?

Rahul Singh - 3 years, 1 month ago

For pregnancies, I get that if we don't separate males and females, then it won't be like a typical normal distribution. Most of the mass will be centered at 0 covering men and the amount of women who don't have children. But there will still probably be other modes, like at 1 and 2 for developed countries and more developing countries right?

John Chen - 2 years, 9 months ago

I think the number of pregnancies would be multimodal if we would separate it at least by continents (Africa vs. Europe are different stories for example).

Vojta Paukner - 1 year, 1 month ago

That's what I thought!

Lucy Rothwell - 7 months, 2 weeks ago

Man Bai
Jan 3, 2018

students scores and Numbers of pregnancies, i think they are normally distributed

I think so.

WU maggie - 10 months, 3 weeks ago

John Phelan
Jul 17, 2020

A traffic light can be in three distinct states as cars approach it (green, yellow red). Therefore there will be a distribution of speeds of cars approaching when green, another when yellow and another when red.

Ruiguo Zhu
Apr 19, 2019

The number of pregnancies is discrete values. it is likely a Poisson distribution.

Student score will be normally distributed without Gaussian mixture model

伦金
Dec 1, 2019

students: T distribution pregnancies: Normal distribution Speed: Decide by different role people may play.

When To Use A Gaussian Mixture Model

5 solutions

0 pending reports