Maximum unambiguous range maximum theoretical range
Interquartile range = range between the first and third quartile.
Cumulative frequency = sum of all frequencies for all values.
Variance = the average of the squared differences from the Mean.
Standard variation = the average of the squared differences from the Mean under a squared root (the same as Variance just under a square root to get rid of the squared unit).
The Range = The distance between two values of which we combine their frequencies to simplify longer datasets.
Quartiles = A division of
Read MoreSOC 222: Measuring the Social World Study Notes
SOC 222: Measuring the Social World
Key Concepts and Definitions
Population vs. Sample
- Population: The entire group you want to study. Example: All students at UTM.
- Sample: A subset of the population used to make conclusions. Example: 100 UTM students surveyed in the library.
- Population Parameter: The true value in the population. Example: The actual percentage of all UTM students who cheat.
- Sample Statistic: The estimate derived from the sample. Example: 15% of surveyed students admit to cheating.
- Sampling
Statistical Analysis: Regression and Probability Models
Regression Analysis and Predictive Modeling
Regression analysis is a statistical method used to model the relationship between variables and to predict the value of one variable using another.
Main Types of Regression
- Simple linear regression: One independent variable and one dependent variable.
- Multiple regression: Several independent variables predicting one dependent variable.
- Logistic regression: Used when the dependent variable represents probabilities or categories.
The goal of simple linear regression
Read MoreEssential Statistics: Sampling, Distributions, and Testing
1. Sampling and Basic Concepts
Population: The entire group being studied.
Sample: A subset of the population.
Example
- Population: All university students.
- Sample: 200 students surveyed.
Parameter vs. Statistic
- Parameter: A numerical value describing a population.
- Statistic: A numerical value derived from a sample.
Examples:
- p = True population proportion.
- p̂ (p-hat) = Sample proportion.
Sample Proportion Formula
p̂ = x / n
Where:
- x = Number of successes.
- n = Sample size.
Example: 48 support a policy out of 80.
Read MoreCIS 2500 Exam 1 Excel Cheat Sheet: Data and Statistics
CIS 2500 – Exam 1 Cheat Sheet (Excel Focused)
Chapter 1 – Data Basics
- Population: All items in the study.
- Sample: Subset of the population.
- Parameter: Numerical value describing a population.
- Statistic: Numerical value describing a sample.
- Cross-sectional: Many entities at one time.
- Time series: One entity across the same point in time.
- Nominal: Labels only (numeric or non-numeric).
- Ordinal: Ranked categories (numeric or non-numeric).
- Interval: Numeric, no true zero.
- Ratio: Numeric, true zero.
- Qualitative
Essential Programming and Data Science Q&A
Python Fundamentals
Q: What is the difference between if, elif, and else?
ifchecks an initial condition.elifchecks another condition if the previous one is false.elseruns if none of the preceding conditions are true.
Q: When do we use a for loop instead of while?
- We use
forwhen we know the number of iterations. whileis used when the condition controls the loop execution.
Q: What is an infinite loop?
A loop that never stops because its condition is always true.
Q: What is a function?
A reusable block
Read More