
Stats and R
1,311 FOLLOWERS
A blog on statistics and R aiming at helping academics and professionals working with data to grasp important concepts in statistics and to apply them in R.
Stats and R
4M ago
I am happy to announce that our paper entitled “Right to be forgotten for mortgage insurance issued to cancer survivors: critical assessment and new proposal” has been accepted for publication in European Actuarial Journal. In this paper, we propose an alternative method to determine the waiting period opening the right to be forgotten in insurance. This new method is based on a constraint imposed to the premium, which is then transposed into a target on the conditional observed survival ..read more
Stats and R
8M ago
Motivation Coarse data Simulated example Real data example References Motivation A group of researchers from the Data Science Institute (DSI) at Hasselt University developed a new statistical model to estimate the incubation period of a pathogenic organism based on coarse data. The incubation period of an infectious disease (defined as the time elapsed between infection and the manifestation of first symptoms) is of great importance as it permits to shed light on the epidemic potential of a disease and to optimize the length of quarantine periods to freeze transmission ..read more
Stats and R
1y ago
Introduction Linear versus logistic regression Univariate versus multivariate logistic regression Data Binary logistic regression in R Univariate binary logistic regression Quantitative independent variable Qualitative independent variable Multivariate binary logistic regression Interaction Model selection Quality of a model Validity of the predictions Accuracy Sensitivity and specificity AUC and ROC curve Reporting results {gtsummary} package {finalfit} package Conditions of application Conclusion Introduction Regression is a common tool in statistics to test and quantify relationships betwee ..read more
Stats and R
1y ago
Introduction How likely is it? For our team For teams of different sizes Conclusion Introduction Last week, I joined a team to work on a collaborative project. The team was already established for a few months, with several scientists working together on the project. For simplicity, they used to sign documents, mention colleagues in emails, etc. with their initials (the first letter of their first name followed by the first letter of their last name ..read more
Stats and R
1y ago
Introduction Data {dplyr} package Filter observations The pipe operator Extract observations Based on their positions Based on their values Sample observations Sort observations Select variables Rename variables Create or modify variables Summarize observations Identify distinct values Connected operations Group by Number of observations Number of distinct values First, last or nth value If else Case when Conclusion and other resources Introduction In a previous post, we showed how to manipulate data in R ..read more
Stats and R
1y ago
I am happy to announce that our paper has been accepted for publication in Nutrients (ISSN 2072-6643) (Houben et al. 2023). This study investigates the impact of a food rebalancing program associated with plant-derived food supplements on the biometric, behavioral, and biological parameters of obese subjects. Read more here. Thanks to all co-authors for the great work, and the Nutrients Editorial Office for their guidance throughout this process. We are also thankful to the two anonymous reviewers for their input that has greatly helped shape the paper ..read more
Stats and R
1y ago
Introduction Data With ties Without ties Correlation coefficients by hand Pearson With and without ties Spearman With ties Without ties Kendall Without ties With ties Verification in R Conclusion Introduction In statistics, a correlation is used to evaluate the relationship between two variables. In a previous post, we showed how to compute a correlation and perform a correlation test in R ..read more
Stats and R
1y ago
Introduction Scraping Yahoo search engine results with R Conclusion Note: This is a guest post by Manthan Koolwal, founder of Scrapingdog. Introduction Web scraping is the process of extracting data from websites. It is usually done in an automated manner to obtain a large amounts of data through various websites, without the need to gather data by hand. In a previous post, we introduced this method and illustrated it with a Wikipedia page ..read more
Stats and R
1y ago
Introduction The two-way ANOVA (analysis of variance) is a statistical method that allows to evaluate the simultaneous effect of two categorical variables on a quantitative continuous variable. The two-way ANOVA is an extension of the one-way ANOVA since it allows to evaluate the effects on a numerical response of two categorical variables instead of one. The advantage of a two-way ANOVA over a one-way ANOVA is that we test the relationship between two variables, while taking into account the effect of a third variable ..read more
Stats and R
2y ago
I am happy to announce that a paper I contributed to has been accepted for publication in Open Forum Infectious Diseases (Truong et al. 2023). This study investigates bacterial meningitis among children aged under five years in Southern Vietnam for the last 10 years. Read more here. I hope this paper will, to some extent, be helpful for your research. As always, if you have any question related to the topic covered in this paper, please add it as a comment so other readers can benefit from the discussion ..read more