Reddit » Statistics
343 FOLLOWERS
This is a subreddit for the discussion of statistical theory, software and application.
Reddit » Statistics
34m ago
Hey I need help choosing between Cal Poly Masters in Science Statistics or CSU Long Beach Masters in Science Applied Statistics. I'm going to list My own personal Pros and Cons of each. If anyone has or is apart of these programs or knows someone whose graduated please let me know what you think.
Cal Poly MS Statistics:
Pros:
Good Department (all Professors I've talked to from my undergrad have given thumbs up for this department)
Good core sequence offered
name recognition
I know a friend that lives in SLO
Campus is really nice
Thesis
consulting classes are part of the core curriculum.
Con ..read more
Reddit » Statistics
34m ago
I am working with this dataset- https://www.fema.gov/about/openfema/data-sets/national-household-survey 2023 fema national household survey.
I converted each column to a yes or no question duplicating columns as needed on the advice of my PI. This took about eight weeks. It also has caused some multicollinearity concerns with my data.
1- If I had not converted each data to a yes or no question would I still have multicollinearity concerns? 2- With the way the data was what test would be the best way to figure out the hypothesis of ethnic minority lower income and likelihood of disaster. I was ..read more
Reddit » Statistics
6h ago
A researcher wants to compare the performance of four learning techniques
on multiple data sets (five) using the performance measure, area under the ROC
curve. The data for the scenario is given below. Determine whether there is any
statistical difference in the performance of different learning techniques.
https://imgur.com/a/SCTpMsT
submitted by /u/kolenski1524
[visit reddit] [comments ..read more
Reddit » Statistics
7h ago
So in my research work I have eight horticulture crops across 4 locations as a factor. I am assessing their soil organic carbon at two depths. Under each location I've taken 3 farms each as my replication and data for soil organic carbon was collected at two depths. Now from what I've seen this data has to be analysed separately for each crop. But which statistical analysis do I need to follow if locations and depths are my two factors and there are 3 replications?
submitted by /u/AphroFelicity20
[visit reddit] [comments ..read more
Reddit » Statistics
11h ago
Can anyone help me go through this problem, i think we will use chi-sqaure test for this but im not sure, here is the problem: https://imgur.com/a/gISecbD
submitted by /u/kolenski1524
[visit reddit] [comments ..read more
Reddit » Statistics
11h ago
In my stats department there’s a faculty member who is a researcher in design of experiments. Mainly optimal design, but extending these ideas to modern data science applications (how to create designs for high dimensional data (super saturated designs)) and other DOE related work in applied data science settings.
I tried to find other faculty members in DOE, but aside from one at nc state and one at Virginia tech, I pretty much cannot find anyone who’s a researcher in design of experiments. Why are there not that many of these people in research? I can find a Bayesian at every department, bu ..read more
Reddit » Statistics
13h ago
Hello everyone, I have data that collected from same participants over 6 days and under 2 conditions each day(6x2 data points(columns) per subject). Distribution is not normal. Our aim is to check if there is a difference between these 2 conditions. So basically, I need to compare 2 conditions within each day and see if there is a difference. I thought to conduct wilcoxon signed rank test for each day, and then adjust p-values using holm-bonferrini method but would it be wrong?
submitted by /u/yagizdemir
[visit reddit] [comments ..read more
Reddit » Statistics
14h ago
I'm hoping that someone can help me with this, although it may seem a little morbid.
My father died of lung cancer on January 5th 2005.
My mother was murdered January 5th 2019.
Both died at the age of 56.
Does anyone know the likelihood/statistical probability of this happening? I've just always wondered.
Thank you for any help!
submitted by /u/TardisBlueSweetie
[visit reddit] [comments ..read more
Reddit » Statistics
20h ago
Part of my work is measuring steel structure, which requires us to pick up the edges of the structure.
As you can see in this image a pickup of some arbitrary "I" beams, i have manually picked out some points on those edges, though this isnt very robust.
Ideally i would like to select a group of cloud points, and have the x, y, or z edge picked out with some sort of statistical result.
I thought about taking the selection of points, asking the user if it was an x,y, or z edge, sorting by chosen direction, and ignoring the top 3% to have some sort of outlier filter. then take the average of th ..read more
Reddit » Statistics
23h ago
I am reading a book that introduces multivariate statistics, and In a chapter, they introduced PCA I already explained how it works but then they started with the question if we should do PCA with the covariance or correlation matrix, they say that when units do not matter we should use correlation as with this we can get the standardized units and the measure of the unit does not longer affects.
But then they say we should use a covariance matrix as this allows us to avoid making each variable equally important, so they never really concluded which should be a common approach.
Can someone pl ..read more