

Token_result ] #> "data" "fusion" "for" "correcting" #> "measurement" "errors" "tracy" "schifeling" #> "jerome" "p" "reiter" "maria" #> "deyoreo" "arxiv" "1610.00147v1" "" #> "1" "oct" "2016" "abstract" #> "often" "in" "surveys" "key" #> "items" "are" "subject" "to" #> "measurement" "errors" "given" "just" #> "the" "data" "it" "can" #> "be" "difficult" "to" "determine" #> "the" "distribution" "of" "this" #> "error" "process" "and" "hence" #> "to" "obtain" "accurate" "inferences" #> "that" "involve" "the" "error" #> "prone" "variables" "in" "some" #> "settings" "however" "analysts" "have" #> "access" "to" "a" "data" #> "source" "on" "different" "in" #> "dividuals" "with" "high" "quality" #> "measurements" "of" "the" "error" #> "prone" "survey" "items" "we" #> "present" "a" "data" "fusion" #> "framework" "for" "leveraging" "this" #> "information" "to" "improve" "infer" #> "ences" "in" "the" "error" #> "prone" "survey" "the" "basic" #> "idea" "is" "to" "posit" #> "models" "about" "the" "rates" #> "at" "which" "individuals" "make" #> "errors" "coupled" "with" "models" #> "for" "the" "values" "reported" #> "when" "errors" "are" "made" #> "this" "can" "avoid" "the" #> "unrealistic" "assumption" "of" "conditional" #> "independence" "typically" "used" "in" #> "data" "fusion" "we" "apply" #> "the" "approach" "on" "the" #> "re" "ported" "values" "of" #> "educational" "attainments" "in" "the" #> "american" "community" "survey" "using" #> "the" "national" "survey" "of" #> "college" "graduates" "as" "the" #> "high" "quality" "data" "source" #> "in" "doing" "so" "we" #> "account" "for" "the" "informative" #> "sampling" "design" "used" "to" #> "select" "the" "national" "survey" #> "of" "college" "graduates" "we" #> "also" "present" "a" "process" #> "for" "assessing" "the" "sensitivity" #> "of" "various" "analyses" "to" #> "different" "choices" "for" "the" #> "measurement" "error" "models" "supplemental" #> "material" "is" "available" "online" #> "key" "words" "fusion" "imputation" #> "measurement" "error" "missing" "survey" #> "this" "research" "was" "supported" #> "by" "the" "national" "science" #> "foundation" "under" "award" "ses" #> "11" "31897" "the" "authors" #> "wish" "to" "thank" "seth" #> "sanders" "for" "his" "input" #> "on" "informative" "prior" "specifications" #> "and" "mauricio" "sadinle" "for" #> "discussion" "that" "improved" "the" #> "strategy" "for" "accounting" "for" #> "the" "informative" "sample" "design" #> "1"Īnother implementation of the convert_tokens function, is to convert the result text to tokens. " #> "We present a data fusion framework for leveraging this information to improve inferences in the error-prone survey. " #> "In some settings, however, analysts have access to a data source on different individuals with high quality measurements of the error-prone survey items.

" #> #> ] #> "Given just the data, it can be difficult to determine the distribution of this error process, and hence to obtain accurate inferences that involve the error-prone variables. " #> "Given just the data, it can be difficult to determine the distribution of this error process, and hence to obtain accurate inferences that involve the error-prone variables. " #> "Reiter, Maria DeYoreo* arXiv:1610.00147v1 Abstract Often in surveys, key items are subject to measurement errors.

The location of the keyword match, including page number and line number, and the actual line of text are returned by default.įile # A tibble: 6 x 5 #> keyword page_num line_num line_text token_text #> #> 1 measurement 1 2 #> 2 measurement 1 4 #> 3 measurement 1 10 #> 4 measurement 1 12 #> 5 measurement 1 15 #> 6 measurement 1 17 head(result $line_text, n = 2) #> ] #> "Data Fusion for Correcting Measurement Errors Tracy Schifeling, Jerome P. " #> #> ] #> "In some settings, however, analysts have access to a data source on different individuals with high quality measurements of the error-prone survey items. Library(pdfsearch) file # A tibble: 6 x 5 #> keyword page_num line_num line_text token_text #> #> 1 measurement 1 2 #> 2 measurement 1 4 #> 3 measurement 1 10 #> 4 measurement 1 12 #> 5 measurement 1 15 #> 6 measurement 1 17 head(result $line_text, n = 2) #> ] #> "Reiter, Maria DeYoreo* arXiv:1610.00147v1 Abstract Often in surveys, key items are subject to measurement errors.
