[Q] I have a question regarding normality of variable
Reddit » Statistics
by /u/kolenski1524
2h ago
Can anyone help me go through this problem, i think we will use chi-sqaure test for this but im not sure, here is the problem: https://imgur.com/a/gISecbD submitted by /u/kolenski1524 [visit reddit] [comments ..read more
Visit website
Why are there barely any design of experiments researchers in stats departments? [Q]
Reddit » Statistics
by /u/Direct-Touch469
2h ago
In my stats department there’s a faculty member who is a researcher in design of experiments. Mainly optimal design, but extending these ideas to modern data science applications (how to create designs for high dimensional data (super saturated designs)) and other DOE related work in applied data science settings. I tried to find other faculty members in DOE, but aside from one at nc state and one at Virginia tech, I pretty much cannot find anyone who’s a researcher in design of experiments. Why are there not that many of these people in research? I can find a Bayesian at every department, bu ..read more
Visit website
[Q] Multiple Wilcoxon Signed Rank Tests?
Reddit » Statistics
by /u/yagizdemir
3h ago
Hello everyone, I have data that collected from same participants over 6 days and under 2 conditions each day(6x2 data points(columns) per subject). Distribution is not normal. Our aim is to check if there is a difference between these 2 conditions. So basically, I need to compare 2 conditions within each day and see if there is a difference. I thought to conduct wilcoxon signed rank test for each day, and then adjust p-values using holm-bonferrini method but would it be wrong? submitted by /u/yagizdemir [visit reddit] [comments ..read more
Visit website
[Q] maybe a bit of a morbid question?
Reddit » Statistics
by /u/TardisBlueSweetie
5h ago
I'm hoping that someone can help me with this, although it may seem a little morbid. My father died of lung cancer on January 5th 2005. My mother was murdered January 5th 2019. Both died at the age of 56. Does anyone know the likelihood/statistical probability of this happening? I've just always wondered. Thank you for any help! submitted by /u/TardisBlueSweetie [visit reddit] [comments ..read more
Visit website
[Q] What methods could you use to determine the correct edge point (or line) of a point cloud
Reddit » Statistics
by /u/GusIsBored
11h ago
Part of my work is measuring steel structure, which requires us to pick up the edges of the structure. As you can see in this image a pickup of some arbitrary "I" beams, i have manually picked out some points on those edges, though this isnt very robust. Ideally i would like to select a group of cloud points, and have the x, y, or z edge picked out with some sort of statistical result. I thought about taking the selection of points, asking the user if it was an x,y, or z edge, sorting by chosen direction, and ignoring the top 3% to have some sort of outlier filter. then take the average of th ..read more
Visit website
[Q] Correlation or Covariance matrix on PCA
Reddit » Statistics
by /u/Unhappy_Passion9866
14h ago
I am reading a book that introduces multivariate statistics, and In a chapter, they introduced PCA I already explained how it works but then they started with the question if we should do PCA with the covariance or correlation matrix, they say that when units do not matter we should use correlation as with this we can get the standardized units and the measure of the unit does not longer affects. But then they say we should use a covariance matrix as this allows us to avoid making each variable equally important, so they never really concluded which should be a common approach. Can someone pl ..read more
Visit website
[Q] What is the most efficient way to do a mediation if you can’t use software.
Reddit » Statistics
by /u/GiraffesDrinking
16h ago
I am doing research work. This is my first time doing research independently and the data is very new to me. For reasons unknown. I can’t do a mediation test (see after the text) the Hayes software doesn’t work on my computer and have not been able to get someone out to look at my computer. I need to run a mediation analysis is there a website I can use. I have thousands of columns or rows. I’m sure you could do it by hand but I think that would take all night. [http://afhayes.com/introduction-to-mediation-moderation-and-conditional-process-analysis.html] submitted by /u/GiraffesDrinking [vi ..read more
Visit website
[E] BSc in Data Science Engineering: What do you think?
Reddit » Statistics
by /u/Scbr24
17h ago
https://imgur.com/a/SjRB7SO I’m from Chile. Here there is this thing called professional titles. It’s 1-2 years of elective courses and a graduating project after finishing the 4 years of a bachelor’s degree (fixed curriculum, few electives). The title is culturally accepted and taken for granted and employers expect it. T1 university in the country has been pushing for some years now the college model of the US and Europe, with majors, minors and the option to not pursue a professional title. In 2021 they released a BSc in Data Science Engineering, 4 years instead of the usual 5-6 years. I’m ..read more
Visit website
[Q] How to assess the change in programs offered by several institutions within a state over time?
Reddit » Statistics
by /u/teacherofderp
22h ago
I have data for 30 colleges within a large state that spans over a 10 year period. Some colleges are primarily rural, while others are urban. In year 5, a policy is introduced that encourages each college to tailor their programs of study around their region’s local economy. If implemented as intended, each college would alter their offered programs of study in different ways based on the economic needs of the region they serve. I’d like to determine what influence a policy introduced in year 5 has had on its intended groups. I’m struggling to find a statistical way to structure the data to r ..read more
Visit website
Datasets for Causal ML [D]
Reddit » Statistics
by /u/Direct-Touch469
22h ago
Does anyone know what datasets are out there for causal inference? I’d like to explore methods in the doubly robust ML literature, and I’d like to compensate my learning by working on some datasets and learn the econML software. Does anyone know of any datasets, specifically in the context of marketing/pricing/advertising that would be good sources to apply causal inference techniques? I’m open to other datasets as well. submitted by /u/Direct-Touch469 [visit reddit] [comments ..read more
Visit website

Follow Reddit » Statistics on FeedSpot

Continue with Google
Continue with Apple
OR