{"id":392,"date":"2013-04-09T11:43:00","date_gmt":"2013-04-09T15:43:00","guid":{"rendered":""},"modified":"2015-11-15T11:34:22","modified_gmt":"2015-11-15T16:34:22","slug":"statistics-2-0-dynamic-frames","status":"publish","type":"post","link":"https:\/\/jeremy-wu.info\/?p=392","title":{"rendered":"Statistics 2.0: Dynamic Frames"},"content":{"rendered":"<div align=\"center\" style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in; text-align: center;\"><b style=\"mso-bidi-font-weight: normal;\"><span style=\"font-size: 14.0pt;\"> <\/span><\/b><\/div>\n<div align=\"center\" style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in; text-align: center;\"><span style=\"font-size: 12.0pt;\"> <\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><b style=\"mso-bidi-font-weight: normal;\"><span style=\"font-size: 14.0pt;\">Abstract<\/span><\/b><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"clear: both; text-align: center;\"><a href=\"http:\/\/4.bp.blogspot.com\/-XMurApKGCCs\/UWMmG04WjiI\/AAAAAAAAC0U\/uiTJLkzkuls\/s1600\/bd-271a_stadium_crowd_scene.jpg\" style=\"clear: right; float: right; margin-bottom: 1em; margin-left: 1em;\"><img loading=\"lazy\" decoding=\"async\" border=\"0\" height=\"121\" src=\"http:\/\/4.bp.blogspot.com\/-XMurApKGCCs\/UWMmG04WjiI\/AAAAAAAAC0U\/uiTJLkzkuls\/s1600\/bd-271a_stadium_crowd_scene.jpg\" width=\"320\" \/><\/a><\/div><p><\/p>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">A frame identifies all the known units in a population from which a census can be conducted or a random sample can be drawn, providing the structural foundation for the extraction of maximum, reliable information from designed statistical studies with the support of established statistical theories.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>The significance of the Big Data era is that most data are now digitized, easily stored, and processed in large quantity at relatively low cost.<\/span> <span style=\"mso-spacerun: yes;\">&nbsp;<\/span><span style=\"font-size: 12.0pt;\">Big Data offers unprecedented opportunities for statisticians to rethink and innovate.<\/span> <span style=\"mso-spacerun: yes;\">&nbsp;<\/span><span style=\"font-size: 12.0pt;\">Among the many possibilities offered by Big Data is the creation and maintenance of Dynamic Frames \u2013 frames that are rich in content, capture the most up-to-date data as soon as they become available, and produce results and reports in real time on demand.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><b style=\"mso-bidi-font-weight: normal;\"><span style=\"font-size: 14.0pt;\">Traditional Population and Frame<\/span><\/b><span style=\"font-size: 12.0pt;\"><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">A population is an important concept in the study of statistics.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>It is commonly understood to be an entire collection of items of interest, be it a nation\u2019s people or businesses, a day&#8217;s production of light bulbs, or an ocean\u2019s fish [1,2,3].<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">A less well-known term is a frame, or a list of the units that cover the entire population with its identification system.<span style=\"mso-spacerun: yes;\">&nbsp;&nbsp; <\/span>A frame is the working definition of a population under study.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>It identifies all the known units in a population from which a census can be conducted or a random sample can be drawn, providing the structure for statistical description and analysis about the population [2,4,5].<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">&nbsp;<\/p>\n<table cellpadding=\"0\" cellspacing=\"0\" style=\"float: right; margin-left: 1em; text-align: right;\">\n<tbody>\n<tr>\n<td style=\"text-align: center;\"><a href=\"http:\/\/2.bp.blogspot.com\/-OX6hZRncN60\/UWMnDSe-kbI\/AAAAAAAAC0c\/pT_eymeTc4k\/s1600\/DynamicFramesFig1.png\" style=\"clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;\"><img loading=\"lazy\" decoding=\"async\" alt=\"\" border=\"0\" height=\"190\" src=\"http:\/\/2.bp.blogspot.com\/-OX6hZRncN60\/UWMnDSe-kbI\/AAAAAAAAC0c\/pT_eymeTc4k\/s1600\/DynamicFramesFig1.png\" title=\"Figure 1\" width=\"400\" \/><\/a><\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">Figure 1<\/td>\n<\/tr>\n<\/tbody>\n<\/table><p><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">Figure 1 shows a flow chart of a conventional statistical study by census or random sample.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Quoting from [4], an ideal frame should have the following qualities:<\/span><\/div>\n<ul>\n<li><span style=\"font-family: Symbol; font-size: 12.0pt; mso-bidi-font-family: Symbol; mso-fareast-font-family: Symbol;\"><span style=\"mso-list: Ignore;\"><span style=\"font: 7.0pt &quot;Times New Roman&quot;;\"><\/span><\/span><\/span><span style=\"font-size: 12.0pt;\">All units have a logical, numerical identifier<\/span><\/li>\n<li><span style=\"font-size: 12.0pt;\">All units can be found \u2013 their contact information, map location or other relevant information is present<\/span><\/li>\n<li><span style=\"font-size: 12.0pt;\">The frame is organized in a logical, systematic fashion<\/span><\/li>\n<li><span style=\"font-size: 12.0pt;\">The frame has additional information about the units that allow the use of more advanced sampling frames<\/span><\/li>\n<li><span style=\"font-size: 12.0pt;\">Every element of the population of interest is present in the frame<\/span><\/li>\n<li><span style=\"font-size: 12.0pt;\">Every element of the population is present only once in the frame<\/span><\/li>\n<li><span style=\"font-size: 12.0pt;\">No elements from outside the population of interest are present in the frame<\/span><\/li>\n<li><span style=\"font-size: 12.0pt;\">The data is \u201cup-to-date\u201d<\/span><\/li>\n<\/ul>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Modeling may be considered part of a sampling process, sometimes bypassing the need for a frame by <b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">assuming<\/i><\/b> that the model and data adequately represent the underlying population.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Practicing statisticians understand the importance of frames \u2013 it is the structural foundation for the extraction of maximum, reliable information from designed statistical studies with the support of established statistical theories.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>However, there are few statistical papers or forums that discuss the best practices for creating and maintaining a frame, primarily because it is viewed as an administrative or clerical task. <\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Many lament how difficult it is to obtain or maintain a good frame or their bitter experience of working with incomplete or error-prone frames.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Indeed, poor quality frames may prevent a well-planned statistical study from even taking place or create misleading or biased results.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Inadequate attention to the creation and maintenance of a flexible, up-to-date, and dynamic population frame has been costly to the statistics profession and the U.S. in terms of efficiency and innovation.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">For example, according to [6], although \u201can accurate and complete address list is a critical ingredient in all U.S. Census Bureau surveys and censuses,\u201d each program prepared its own separate list until the concept of a national frame was advanced not even 20 years ago in the name of the Master Address File (MAF).<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">The MAF is used primarily to support mail delivery of questionnaires [7], which is increasingly an outdated mode for information collection.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>It is relied upon heavily for follow-up visits to non-respondents, when rising labor costs are now met with tight budget constraints.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Web-based questionnaire delivery or data submission was not allowed in the latest 2010 decennial census in the U.S.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>The MAF is also not designed to promote or support web-based applications.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">The arrival of the Big Data era seems to have caught the statistics profession in a deer-in-the-headlight moment.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>As statistician is hailed as \u201cthe sexiest job for the next 10 years\u201d and beyond [8], the profession is still wondering why statistics is undervalued and left out, while in search of a role it should play in the Big Data era [9].<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Only a few seem to recognize that statistics is \u201cthe science of learning from data\u201d [10], regardless of how big or small the data are, and that the moment has arrived for the profession to join the revolution and remain relevant in the future.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><b style=\"mso-bidi-font-weight: normal;\"><span style=\"font-size: 14.0pt;\">Statistics 2.0: Dynamic Frames<\/span><\/b><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Big Data is a relative concept.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Tomorrow\u2019s Big Data will be bigger than today\u2019s Big Data.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>If it is only the size of data that statisticians would consider, the impact of Big Data would be limited to only scaling the existing software and methods.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">The significance of the Big Data era is that most data are now digitized, including sound, vision, and handwriting [e.g., 11], much of which have never been available before.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>They can be easily stored and processed in large quantity at relatively low cost.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Today\u2019s consumers of statistics are much higher in number and less interested in technical details, but they also want comprehensive, reliable, easy-to-use information rapidly and readily.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Big Data is as much a revolution in information technology as it is for advancement in statistics because it offers unprecedented opportunities for statisticians to rethink its systems and operations and innovate.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">For example, mathematical statistics clearly demonstrates that a 5 percent random sample is superior to a 5 percent non-random sample.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>However, how does it compare to a 50 percent or a 95 percent non-random sample?<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>We have continued to caution, warn, condemn, or dismiss large, non-random samples, but have done little to go beyond the existing framework of mathematical statistics. <span style=\"mso-spacerun: yes;\">&nbsp;<\/span>Is there not a point, albeit that it may vary from case to case, where the inherent statistical bias can be reduced by the large size of a non-random sample so that they can become practically acceptable and meaningful?<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">As another example, as long as Figure 1 remains the typical process of conducting statistical studies in a sequential and cross-sectional manner, there is little room for innovative improvement to reduce turnaround time or introduce new metrics such as measuring longitudinal change at the unit level [12].<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Is it absolutely impossible to produce accurate and reliable statistical results in real time?<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Or is it because we have become so comfortable with the present software, approach, and convenience that there is no desire to consider other possibilities? <\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Random sampling has been the dominant mode of statistical operation for a century [13].<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Because of Big Data, one may now study an entire population almost as easily as one can study a random sample today.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Should we ignore this opportunity?<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">If statisticians do not recognize or embrace the challenges of theory and practice posted by Big Data as part of the core of studying and practicing statistics, the risk is high that others including the yet-undefined \u201cdata scientists\u201d will fill the void [14]. <\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Among the many possibilities offered by Big Data is the creation and maintenance of Dynamic Frames \u2013 population frames that are rich in content, capture the most up-to-date data as soon as they become available, and produce results and reports according to established schedules or even in real time.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">With some user base exceeding one billion people in membership, E-Commerce companies and the social media are well positioned to apply their data from online transactions, emails, and blog postings to conduct market research and perform predictive analyses.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>A lay person may also capture these data in a less structured manner.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">&nbsp;<\/p>\n<table cellpadding=\"0\" cellspacing=\"0\" style=\"float: right; margin-left: 1em; text-align: right;\">\n<tbody>\n<tr>\n<td style=\"text-align: center;\"><a href=\"http:\/\/4.bp.blogspot.com\/-TwPStqofYUo\/UWMngN76cpI\/AAAAAAAAC0k\/sx3Qz7NzWRg\/s1600\/DynamicFrames02+20130408.jpg\" style=\"clear: right; margin-bottom: 1em; margin-left: auto; margin-right: auto;\"><img loading=\"lazy\" decoding=\"async\" alt=\"\" border=\"0\" height=\"280\" src=\"http:\/\/4.bp.blogspot.com\/-TwPStqofYUo\/UWMngN76cpI\/AAAAAAAAC0k\/sx3Qz7NzWRg\/s1600\/DynamicFrames02+20130408.jpg\" title=\"Figure 2\" width=\"400\" \/><\/a><\/td>\n<\/tr>\n<tr>\n<td style=\"text-align: center;\">Figure 2<\/td>\n<\/tr>\n<\/tbody>\n<\/table><p><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Figure 2 provides a simple schematic on how the Dynamic Frames may work, which are also described as longitudinal data systems in educational applications in the U.S. [15,16]<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">In essence, primary efforts are put into the creation and maintenance of the frame so that it is optimized by the previously identified properties.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>It is constantly updated with new data for every sampling unit over time.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Statisticians must be fully engaged in the design, implementation, and operation of Dynamic Frames, in addition to the production of descriptive and analytical results.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>There are many new and traditional functions that statisticians can make major contributions.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">For example, the identification code is a key to unlocking the enormous power in Big Data.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>It controls the extent additional records and data may be linked, determines firsthand the overall quality of data and study, and is the first safeguard to protect confidentiality.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">As another example, the size and content for the units have no conceivable limit.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>They depend only on availability of data, ability to link and match records, and design of system.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Effective operation minimizes mismatches of records and collection of duplicative data that do not change or change in predictable manner.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><span style=\"mso-spacerun: yes;\">&nbsp;<\/span>Appropriate replacement or imputation for missing values ensures quality and timely integration of data.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\">Other enhancement of traditional statistical functions [14] include, but are not limited to, establishing continuous quality loops back to the data sources; developing new definitions, metrics, and standards for the dynamic frames; applying new statistical modeling for imputation, profiling, risk assessment, and creating artificial intelligence; developing innovative visualizations; improving statistical training and education; and protecting confidentiality.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><b style=\"mso-bidi-font-weight: normal;\"><span style=\"font-size: 14.0pt;\">Summary<\/span><\/b><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin: 0in; mso-add-space: auto;\"><span style=\"font-size: 12.0pt;\"><span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">Dynamic frames will retain its original purpose as a list of known units for conducting censuses and drawing random samples as needed, but the potential use of structured Big Data is limited only by the imagination and innovative spirit of the statistics profession.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Statisticians need to embrace Big Data as its own revolution, which will lead to the next level of human knowledge and practice by study and use of data. <\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/p>\n<div style=\"line-height: normal; margin-bottom: 0.0001pt; text-align: left;\"><span style=\"font-size: 12.0pt;\">Co-authored by&nbsp;<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: 0.0001pt; text-align: left;\"><span style=\"font-size: 12.0pt;\">Jeremy S. Wu, Ph.D., <\/span><a href=\"mailto:Jeremy.s.wu@gmail.com\"><span style=\"font-size: 12.0pt;\">Jeremy.s.wu@gmail.com<\/span><\/a><\/div><p><span style=\"font-size: 12.0pt;\">Junchi Guo, Ph. D. Candidate, <\/span><a href=\"mailto:junchi@email.gwu.edu\"><span style=\"font-size: 12.0pt;\">junchi@email.gwu.edu<\/span><\/a><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><b style=\"mso-bidi-font-weight: normal;\"><span style=\"font-size: 14.0pt;\">References<\/span><\/b><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[1] Hansen, Morris H.; Hurwitz, William N.; and Madow, William G.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>(1953).<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Sample Survey Methods and Theory.<\/i><\/b><span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Wiley Classics Library Edition, John Wiley & Sons, Inc.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[2] Kish, Leslie.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>(1965).<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Survey Sampling.<\/i><\/b><span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Wiley Classics Library Edition, John Wiley & Sons, Inc.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[3] Cochran, William G.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>(1977).<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Sampling Techniques.<\/i><\/b><span style=\"mso-spacerun: yes;\">&nbsp; <\/span>A Wiley Publication in Applied Statistics, Third Edition, John Wiley & Sons, Inc.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[4] Wikipedia.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Sampling Frame.<\/i><\/b><span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/en.wikipedia.org\/wiki\/Sampling_frame\"><span style=\"font-size: 12.0pt;\">http:\/\/en.wikipedia.org\/wiki\/Sampling_frame<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013. <\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[5] Baidu.com.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Sampling Frame <\/i><\/b><\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\"><span lang=\"ZH-CN\" style=\"font-family: \u5b8b\u4f53; font-size: 12.0pt; mso-ascii-font-family: Calibri; mso-ascii-theme-font: minor-latin; mso-fareast-font-family: \u5b8b\u4f53; mso-fareast-theme-font: minor-fareast; mso-hansi-font-family: Calibri; mso-hansi-theme-font: minor-latin;\">\u62bd\u6837\u6846<\/span><\/i><\/b><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\"><span style=\"font-size: 12.0pt;\">.<\/span><\/i><\/b><span style=\"font-size: 12.0pt;\"><span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/baike.baidu.com\/view\/1652958.htm\"><span style=\"font-size: 12.0pt;\">http:\/\/baike.baidu.com\/view\/1652958.htm<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[6] U.S. Census Bureau.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Master Address File: Update Methodology and Quality Improvement Program,<\/i><\/b> by<\/span> <span style=\"font-size: 12.0pt;\">Philip M. Ghur, <span style=\"mso-spacerun: yes;\">&nbsp;<\/span>Machell Kindred, and Michael L. Mersch, 1994.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"https:\/\/www.amstat.org\/sections\/srms\/Proceedings\/papers\/1994_128.pdf\"><span style=\"font-size: 12.0pt;\">https:\/\/www.amstat.org\/sections\/srms\/Proceedings\/papers\/1994_128.pdf<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[7] U.S. Census Bureau.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">The Master Address File for the 2010 Census<\/i><\/b>, by Joseph Salvo, April 7, 2006.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Brookings Breakfast Briefings on the Census.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/www.brookings.edu\/~\/media\/events\/2006\/4\/07community%20development\/20060407_salvo.pdf\"><span style=\"font-size: 12.0pt;\">http:\/\/www.brookings.edu\/~\/media\/events\/2006\/4\/07community%20development\/20060407_salvo.pdf<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[8] Varian, Hal.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Hal Varian explains why statisticians will be the sexy job in the next 10 years<\/i><\/b>,<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>September 15, 2009.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>YouTube.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/www.youtube.com\/watch?v=pi472Mi3VLw\"><span style=\"font-size: 12.0pt;\">http:\/\/www.youtube.com\/watch?v=pi472Mi3VLw<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[9] Pierson, Steve and Wasserstein, Ron.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Big Data and the Role of Statistics<\/i><\/b>, March 28, 2012.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/community.amstat.org\/amstat\/blogs\/blogviewer?BlogKey=737fd276-0225-4c87-b7cb-0cfc7cd9e124\"><span style=\"font-size: 12.0pt;\">http:\/\/community.amstat.org\/amstat\/blogs\/blogviewer?BlogKey=737fd276-0225-4c87-b7cb-0cfc7cd9e124<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[10] van der Lann, Mark; Hsu, Jiann-Ping; and Rose, Sherri.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Statistics Ready for a Revolution.<\/i><\/b><span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Amstat News, September 1, 2010.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/magazine.amstat.org\/blog\/2010\/09\/01\/statrevolution\/\"><span style=\"font-size: 12.0pt;\">http:\/\/magazine.amstat.org\/blog\/2010\/09\/01\/statrevolution\/<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[11] Washington Post.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">From the President\u2019s Hand to the Internet<\/i><\/b>.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/www.washingtonpost.com\/lifestyle\/style\/from-the-presidents-hand-to-the-internet\/2013\/03\/21\/0b609e66-9282-11e2-9cfd-36d6c9b5d7ad_graphic.html\"><span style=\"font-size: 12.0pt;\">http:\/\/www.washingtonpost.com\/lifestyle\/style\/from-the-presidents-hand-to-the-internet\/2013\/03\/21\/0b609e66-9282-11e2-9cfd-36d6c9b5d7ad_graphic.html<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013. <\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[12] Diggle, Peter J.; Heagerty, Patrick J.; Liang, Kung-Yee; and Zeger, Scott L. (2001).<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Analysis of Longitudinal Data.<\/i><\/b><span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Second Edition, Oxford University Press.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[13] Wu, Jeremy S., Chinese translation by Zhang, Yaoting and Yu, Xiang.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">One Hundred Years of Sampling<\/i><\/b>, invited paper <b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">in Sampling Theory and Practice<\/i><\/b>, ISBN7-5037-1670-3, 1995.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>China Statistical Publishing Company.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[14] Wu, Jeremy S. <b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">21st Century Statistical Systems<\/i><\/b>, August 1, 2012.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"https:\/\/jeremy-wu.info\/21st-century-statistical-systems\/\"><span style=\"font-size: 12.0pt;\">https:\/\/jeremy-wu.info\/21st-century-statistical-systems\/<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[15] Data Quality Campaign.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Using Data to Improve Student Achievement.<\/i><\/b><span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/www.dataqualitycampaign.org\/\"><span style=\"font-size: 12.0pt;\">http:\/\/www.dataqualitycampaign.org\/<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><span style=\"font-size: 12.0pt;\">[16] U.S. Department of Education.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span><b style=\"mso-bidi-font-weight: normal;\"><i style=\"mso-bidi-font-style: normal;\">Statewide Longitudinal Data Systems Grant Program<\/i><\/b>, National Center for Education Statistics.<span style=\"mso-spacerun: yes;\">&nbsp; <\/span>Available at <\/span><a href=\"http:\/\/nces.ed.gov\/programs\/slds\/\"><span style=\"font-size: 12.0pt;\">http:\/\/nces.ed.gov\/programs\/slds\/<\/span><\/a><span style=\"font-size: 12.0pt;\"> on April 8, 2013.<\/span><\/div>\n<div style=\"line-height: normal; margin-bottom: .0001pt; margin-bottom: 0in;\"><\/div>\n","protected":false},"excerpt":{"rendered":"<p>Abstract A frame identifies all the known units in a population from which a census can be conducted or a random sample can be drawn, providing the structural foundation for [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":440,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[5,6],"tags":[39,41,25,40],"class_list":["post-392","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-big-data","category-statistics","tag-dynamic-frames","tag-frames","tag-innovation","tag-statisticians"],"_links":{"self":[{"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=\/wp\/v2\/posts\/392","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=392"}],"version-history":[{"count":1,"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=\/wp\/v2\/posts\/392\/revisions"}],"predecessor-version":[{"id":445,"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=\/wp\/v2\/posts\/392\/revisions\/445"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=\/wp\/v2\/media\/440"}],"wp:attachment":[{"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=392"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=392"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/jeremy-wu.info\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=392"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}