Fundamentals of Statistics: Concepts and Data Analysis

1. Statistics: Descriptive vs. Inferential

Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting data to draw meaningful conclusions and support decision-making.

Comparison of Statistical Methods

BasisDescriptive StatisticsInferential Statistics
MeaningSummarizes and describes dataDraws conclusions about population
PurposeTo present data clearlyTo make predictions/decisions
Data usedUses complete data setUses sample data
TechniquesMean, median, mode, graphsProbability, hypothesis testing
ComplexitySimple methodsAdvanced methods
OutputTables, charts, summariesEstimates, predictions
Example useSignal data summaryPredict signal behavior

Example (Signal Processing): If we measure signal amplitudes from a sensor, calculating mean and variance is descriptive statistics. Using sample data to predict future signal noise or system performance is inferential statistics.


2. Raw Data: Classification and Summarization

Raw Data: This is the original, unorganized data collected directly from observations or measurements without any processing.

Why Classify and Summarize Data?

  • Simplifies large amounts of data
  • Makes data easy to understand
  • Helps in the comparison of different datasets
  • Identifies patterns and trends in signals
  • Helps in detecting errors or noise
  • Saves time in analysis
  • Supports better decision-making

Example: Voltage readings taken continuously can be grouped into intervals to study signal behavior easily.


3. Qualitative vs. Quantitative Data

Data Comparison Table

BasisQualitative DataQuantitative Data
NatureDescriptiveNumerical
MeasurementCannot be measuredCan be measured
FormCategoriesNumbers
AnalysisNon-mathematicalMathematical
Example typeModulation typeVoltage value
PrecisionLess preciseMore precise
RepresentationWords/labelsNumbers/values

Variable Classification

  • Type of modulation used: Qualitative
  • Signal amplitude (in volts): Quantitative
  • Status of switch (ON/OFF): Qualitative

4. Scales of Measurement

Data is categorized into four primary scales:

  1. Nominal Scale: Used for labeling or categories only; no inherent order (e.g., ON/OFF).
  2. Ordinal Scale: Data has a specific order, but no exact difference between values (e.g., rank).
  3. Interval Scale: Equal intervals between values, but no true zero point (e.g., temperature in °C).
  4. Ratio Scale: Equal intervals with a true zero point (e.g., voltage, frequency).

Classification Examples

  • Temperature in °C: Interval Scale
  • Number of packets transmitted: Ratio Scale
  • Rank of signal strength: Ordinal Scale

5. Statistical Classification

Statistical Classification is the process of grouping data into classes or categories based on similar characteristics to make analysis simple and meaningful.

Characteristics of Effective Classification

  • Simple and clear: Easy to interpret.
  • Mutually exclusive: No overlap between categories.
  • Exhaustive: Covers all data points.
  • Homogeneous: Similar items are grouped together.
  • Flexible: Can adjust to new data.
  • Purpose-oriented: Aligned with analysis goals.
  • Stable and consistent: Reliable over time.