- two additional datasets have been included:
jvs
(Job Vacancy Survey; a probability sample survey) andadmin
(Central Job Offers Database; a non-probability sample survey). The units and auxiliary variables have been aligned in a way that allows the data to be integrated using the methods implemented in this package. - a
nonprobsvycheck
function was added to check the balance in the totals of the variables based on the weighted weights between the non-probability and probability samples. - Important - the functions
controlSel
,controlOut
andcontrolInf
have been replaced by their counterpartscontrol_sel
,control_out
andcontrol_inf
.
- basic methods and functions related to variance estimation, weights and probability linking methods have been rewritten in a more optimal and readable way.
- annotation has been added that arguments such as
strata
,subset
andna_action
are not supported for the time being.
- bug Fix occurring when estimation was based on auxiliary variable, which led to compression of the data from the frame to the vector.
- bug Fix related to not passing
maxit
argument fromcontrolSel
function to internally usednleqslv
function - bug Fix related to storing
vector
inmodel_frame
when predictingy_hat
in mass imputationglm
model when X is based in one auxiliary variable only - fix provided converting it todata.frame
object.
- added information to
summary
about quality of estimation basing on difference between estimated and known total values of auxiliary variables - added estimation of exact standard error for k-nearest neighbor estimator.
- added breaking change to
controlOut
function by switching values forpredictive_match
argument. From now on, thepredictive_match = 1
means$\hat{y}-\hat{y}$ in predictive mean matching imputation andpredictive_match = 2
corresponds to$\hat{y}-y$ matching. - implemented
div
option when variable selection (more in documentation) for doubly robust estimation. - added more insights to
nonprob
output such as gradient, hessian and jacobian derived from IPW estimation formle
andgee
methods whenIPW
orDR
model executed. - added estimated inclusion probabilities and its derivatives for probability and non-probability samples to
nonprob
output whenIPW
orDR
model executed. - added
model_frame
matrix data from probability sample used for mass imputation tononprob
whenMI
orDR
model executed.
- added unit tests for variable selection models and mi estimation with vector of population totals available
- implemented population mean estimation using doubly robust, inverse probability weighting and mass imputation methods
- implemented inverse probability weighting models with Maximum Likelihood Estimation and Generalized Estimating Equations methods with
logit
,complementary log-log
andprobit
link functions. - implemented
generalized linear models
,nearest neighbours
andpredictive mean matching
methods for Mass Imputation - implemented bias correction estimators for doubly-robust approach
- implemented estimation methods when vector of population means/totals is available
- implemented variables selection with
SCAD
,LASSO
andMCP
penalization equations - implemented
analytic
andbootstrap
(with parallel computation -doSNOW
package) variance for described estimators - added control parameters for models
- added S3 methods for object of
nonprob
class such asnobs
for samples sizepop.size
for population size estimationresiduals
for residuals of the inverse probability weighting modelcooks.distance
for identifying influential observations that have a significant impact on the parameter estimateshatvalues
for measuring the leverage of individual observationslogLik
for computing the log-likelihood of the model,AIC
(Akaike Information Criterion) for evaluating the model based on the trade-off between goodness of fit and complexity, helping in model selectionBIC
(Bayesian Information Criterion) for a similar purpose as AIC but with a stronger penalty for model complexityconfint
for calculating confidence intervals around parameter estimatesvcov
for obtaining the variance-covariance matrix of the parameter estimatesdeviance
for assessing the goodness of fit of the model
- added unit tests for IPW estimators.
- added automated
R-cmd
check
- added documentation for
nonprob
function.