PlanetMath (more info)
 Math for the people, by the people. Sponsor PlanetMath
Encyclopedia | Requests | Forums | Docs | Wiki | Random | RSS  
Login
create new user
name:
pass:
forget your password?
Main Menu
Owner confidence rating: Very high Entry average rating: No information on entry rating
regression model (Definition)

In statistical modeling of $N$ data observations ($N<\infty$ ), two types of variables are usually defined. One is the response variable or variate, usually denoted by $Y$ , and the other is the explanatory variable or covariate $X$ . While there is only one response variable, there may be one or more than one explanatory variables. The response variable is considered random, where as the explanatory variable(s) may or may not be random.

Based on the above setup, a univariate regression model, or simply regression model, is a statistical model with the following assumptions:

  1. all of the variables, random or not, are continuous in nature (as opposed to categorical in nature)
  2. the response variable $Y$ can be expressed as the sum of a function $f({X})$ , called the regression function, where ${X}$ represents the row vector of explanatory variables, and an error term $\varepsilon_i$ : $$Y=f(\textbf{X})+\varepsilon=f(X_1,\ldots,X_p)+\varepsilon$$ where $p$ is the number of explanatory variables. $f({X})$ is called the systematic component, and $\varepsilon$ is the random error component.
  3. the error component and the systematic component are independent
  4. random error variables $\varepsilon_i$ for the $N$ observations are iid normal with mean 0 and variance $\sigma^2$

Any unknown variables appearing in the regression function $f$ , other than the covariates, are called the regression coefficients.

Remarks

  • The conditional distribution of $Y$ , given ${X}$ is normal, or Gaussian, with mean $\mu=\operatorname{E}\big[Y\mid{X}=\boldsymbol{x}\big]=\operatorname{E} \big[Y\mid X_1=x_1,\ldots,X_p=x_p\big]$ and variance $\sigma^2$ . In addition, the random variables $Y_i$ corresponding to the reponses are independent.
  • Sometimes, Condition 4 above is skipped to encompass a wider class of regression models. Those models that observe Condition 4 is generally called a normal, or Gaussian regression model. Otherwise, they are classified under the non-linear regression model discussed below. Some well known non-normal regression models are the logistic regression for binary data and the Poisson regression for count data.
  • A regression model can be classified by the number or explanatory variables. If there is only one explanatory variable, it is called a simple regression model. Otherwise, it is a multiple regression model.
  • A regression model can also be classified by the form of the regression function $f$ . If $f$ can be expressed as a linear combination of the regression coefficients: $$f(\textbf{X})=\beta_0z_0(\textbf{X})+\cdots+\beta_kz_k(\textbf{X}),$$ where the functions $z_i({X})$ do not contain any regression coefficients, then the model is called a linear regression model. Two examples of linear regression models are: $$Y=\beta_0+\beta_1X_1+\beta_2X_2+\beta_3X_1X_2+\varepsilon$$ and $$Y=\beta_0+\beta_1X+\cdots+\beta_kX^k+\varepsilon$$ The last one is called a polynomial regression model. Linear regression models belong to a more general class of statistical models called the general linear model, where explanatory variables are no longer restricted to be continuous ones only. When $f$ can not be expressed linearly in terms of the regression coefficients, the model is known as a non-linear regression model. An example of a non-linear regression model is $$Y=\beta_0+\frac{1}{\beta_1+\beta_2X}+\varepsilon$$
  • The univariate regression model can be generalized to what is known as the multivariate regression model, where at least two response variables are considered.




"regression model" is owned by CWoo. [ full author list (2) ]
(view preamble | get metadata)

View style:

See Also: linear least squares fit

Other names:  univariate regression model
Also defines:  regression function, regression coefficient, simple regression model, multiple regression model, linear regression model, polynomial regression model, non-linear regression model
Log in to rate this entry.
(view current ratings)

Cross-references: general linear model, belong, contain, linear combination, binary, logistic regression, random variables, addition, Gaussian, distribution, conditional, variance, mean, normal, iid, independent, component, number, row vector, represents, function, sum, categorical, continuous, statistical model, explanatory variable, response variable, variables, observations
There are 7 references to this entry.

This is version 7 of regression model, born on 2004-07-29, modified 2006-09-24.
Object id is 6046, canonical name is RegressionModel.
Accessed 38980 times total.

Classification:
AMS MSC62J02 (Statistics :: Linear inference, regression :: General nonlinear regression)
 62J05 (Statistics :: Linear inference, regression :: Linear regression)

Pending Errata and Addenda
None.
Discussion
Style: Expand: Order:
forum policy

No messages.

Interact
post | correct | update request | add derivation | add example | add (any)