Maximum unambiguous range maximum theoretical range

Interquartile range = range between the first and third quartile.

Cumulative frequency = sum of all frequencies for all values.

Variance = the average of the squared differences from the Mean.

Standard variation = the average of the squared differences from the Mean under a squared root (the same as Variance just under a square root to get rid of the squared unit).

The Range = The distance between two values of which we combine their frequencies to simplify longer datasets.

Quartiles = A division of

Read More

SOC 222: Measuring the Social World Study Notes

SOC 222: Measuring the Social World

Key Concepts and Definitions

Population vs. Sample

  • Population: The entire group you want to study. Example: All students at UTM.
  • Sample: A subset of the population used to make conclusions. Example: 100 UTM students surveyed in the library.
  • Population Parameter: The true value in the population. Example: The actual percentage of all UTM students who cheat.
  • Sample Statistic: The estimate derived from the sample. Example: 15% of surveyed students admit to cheating.
  • Sampling
Read More

Statistical Analysis: Regression and Probability Models

Regression Analysis and Predictive Modeling

Regression analysis is a statistical method used to model the relationship between variables and to predict the value of one variable using another.

Main Types of Regression

  • Simple linear regression: One independent variable and one dependent variable.
  • Multiple regression: Several independent variables predicting one dependent variable.
  • Logistic regression: Used when the dependent variable represents probabilities or categories.

The goal of simple linear regression

Read More

Essential Statistics: Sampling, Distributions, and Testing

1. Sampling and Basic Concepts

Population: The entire group being studied.
Sample: A subset of the population.

Example

  • Population: All university students.
  • Sample: 200 students surveyed.

Parameter vs. Statistic

  • Parameter: A numerical value describing a population.
  • Statistic: A numerical value derived from a sample.

Examples:

  • p = True population proportion.
  • (p-hat) = Sample proportion.

Sample Proportion Formula

p̂ = x / n

Where:

  • x = Number of successes.
  • n = Sample size.

Example: 48 support a policy out of 80.

Read More

CIS 2500 Exam 1 Excel Cheat Sheet: Data and Statistics

CIS 2500 – Exam 1 Cheat Sheet (Excel Focused)

Chapter 1 – Data Basics

  • Population: All items in the study.
  • Sample: Subset of the population.
  • Parameter: Numerical value describing a population.
  • Statistic: Numerical value describing a sample.
  • Cross-sectional: Many entities at one time.
  • Time series: One entity across the same point in time.
  • Nominal: Labels only (numeric or non-numeric).
  • Ordinal: Ranked categories (numeric or non-numeric).
  • Interval: Numeric, no true zero.
  • Ratio: Numeric, true zero.
  • Qualitative
Read More

Essential Programming and Data Science Q&A

Python Fundamentals

Q: What is the difference between if, elif, and else?

  • if checks an initial condition.
  • elif checks another condition if the previous one is false.
  • else runs if none of the preceding conditions are true.

Q: When do we use a for loop instead of while?

  • We use for when we know the number of iterations.
  • while is used when the condition controls the loop execution.

Q: What is an infinite loop?

A loop that never stops because its condition is always true.

Q: What is a function?

A reusable block

Read More