(all)
Global Humanities | History of Humanities | Liberal Arts | Humanities and Higher Education | Humanities as Research Activity | Humanities Teaching & Curricula | Humanities and the Sciences | Medical Humanities | Public Humanities | Humanities Advocacy | Humanities and Social Groups | Value of Humanities | Humanities and Economic Value | Humanities Funding | Humanities Statistics | Humanities Surveys | "Crisis" of the Humanities
Humanities Organizations: Humanities Councils (U.S.) | Government Agencies | Foundations | Scholarly Associations
Humanities in: Africa | Asia (East) | Asia (South) | Australasia | Europe | Latin America | Middle East | North America: Canada - Mexico - United States | Scandinavia | United Kingdom
(all)
Lists of News Sources | Databases with News Archives | History of Journalism | Journalism Studies | Journalism Statistics | Journalism Organizations | Student Journalism | Data Journalism | Media Frames (analyzing & changing media narratives using "frame theory") | Media Bias | Fake News | Journalism and Minorities | Journalism and Women | Press Freedom | News & Social Media
(all)
Corpus Representativeness
Comparison paradigms for idea of a corpus: Archives as Paradigm | Canons as Paradigm | Editions as Paradigm | Corpus Linguistics as Paradigm
(all)
Artificial Intelligence | Big Data | Data Mining | Data Notebooks (Jupyter Notebooks) | Data Visualization (see also Topic Model Visualizations) | Hierarchical Clustering | Interpretability & Explainability (see also Topic Model Interpretation) | Mapping | Natural Language Processing | Network Analysis | Open Science | Reporting & Documentation Methods | Reproducibility | Sentiment Analysis | Social Media Analysis | Statistical Methods | Text Analysis (see also Topic Modeling) | Text Classification | Wikification | Word Embedding & Vector Semantics
Topic Modeling (all)
Selected DH research and resources bearing on, or utilized by, the WE1S project.
(all)
Distant Reading | Cultural Analytics | | Sociocultural Approaches | Topic Modeling in DH | Non-consumptive Use
Searchable version of bibliography on Zotero site
For WE1S developers: Biblio style guide | Biblio collection form (suggest additions) | WE1S Bibliography Ontology Outline
2133649
Data Science
1
chicago-fullnote-bibliography
50
date
desc
year
1
1
1
2640
https://we1s.ucsb.edu/wp-content/plugins/zotpress/
%7B%22status%22%3A%22success%22%2C%22updateneeded%22%3Afalse%2C%22instance%22%3Afalse%2C%22meta%22%3A%7B%22request_last%22%3A0%2C%22request_next%22%3A0%2C%22used_cache%22%3Atrue%7D%2C%22data%22%3A%5B%7B%22key%22%3A%2249QSYPXT%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Smith%20and%20Cordes%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ESmith%2C%20Gary%2C%20and%20Jay%20Cordes.%20%3Ci%3EThe%20Phantom%20Pattern%20Problem%3A%20The%20Mirage%20of%20Big%20Data%3C%5C%2Fi%3E.%20First%20edition.%20Oxford%26%23x202F%3B%3B%20New%20York%2C%20NY%3A%20Oxford%20University%20Press%2C%202020.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D49QSYPXT%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22The%20phantom%20pattern%20problem%3A%20the%20mirage%20of%20big%20data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gary%22%2C%22lastName%22%3A%22Smith%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jay%22%2C%22lastName%22%3A%22Cordes%22%7D%5D%2C%22abstractNote%22%3A%22Pattern-recognition%20prowess%20served%20our%20ancestors%20well%2C%20but%20today%20we%20are%20confronted%20by%20a%20deluge%20of%20data%20that%20is%20far%20more%20abstract%2C%20complicated%2C%20and%20difficult%20to%20interpret.%20The%20number%20of%20possible%20patterns%20that%20can%20be%20identified%20relative%20to%20the%20number%20that%20are%20genuinely%20useful%20has%20grown%20exponentially%20-%20which%20means%20that%20the%20chances%20that%20a%20discovered%20pattern%20is%20useful%20is%20rapidly%20approaching%20zero.%5Cn%5CnPatterns%20in%20data%20are%20often%20used%20as%20evidence%2C%20but%20how%20can%20you%20tell%20if%20that%20evidence%20is%20worth%20believing%3F%20We%20are%20hard-wired%20to%20notice%20patterns%20and%20to%20think%20that%20the%20patterns%20we%20notice%20are%20meaningful.%20Streaks%2C%20clusters%2C%20and%20correlations%20are%20the%20norm%2C%20not%20the%20exception.%20Our%20challenge%20is%20to%20overcome%20our%20inherited%20inclination%20to%20think%20that%20all%20patterns%20are%20significant%2C%20as%20in%20this%20age%20of%20Big%20Data%20patterns%20are%20inevitable%20and%20usually%20coincidental.%5Cn%5CnThrough%20countless%20examples%2C%20The%20Phantom%20Pattern%20Problem%20is%20an%20engaging%20read%20that%20helps%20us%20avoid%20being%20duped%20by%20data%2C%20tricked%20into%20worthless%20investing%20strategies%2C%20or%20scared%20out%20of%20getting%20vaccinations.%22%2C%22date%22%3A%222020%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22978-0-19-886416-5%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222021-07-01T06%3A42%3A08Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20mining%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Interpretability%20and%20explainability%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22Y7Q3CUCI%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Koenzen%20et%20al.%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EKoenzen%2C%20Andreas%2C%20Neil%20Ernst%2C%20and%20Margaret-Anne%20Storey.%20%26%23x201C%3BCode%20Duplication%20and%20Reuse%20in%20Jupyter%20Notebooks.%26%23x201D%3B%20%3Ci%3EArXiv%3A2005.13709%20%5BCs%5D%3C%5C%2Fi%3E%2C%202020.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2005.13709%27%3Ehttp%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2005.13709%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DY7Q3CUCI%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Code%20Duplication%20and%20Reuse%20in%20Jupyter%20Notebooks%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Andreas%22%2C%22lastName%22%3A%22Koenzen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Neil%22%2C%22lastName%22%3A%22Ernst%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Margaret-Anne%22%2C%22lastName%22%3A%22Storey%22%7D%5D%2C%22abstractNote%22%3A%22Duplicating%20one%27s%20own%20code%20makes%20it%20faster%20to%20write%20software.%20This%20expediency%20is%20particularly%20valuable%20for%20users%20of%20computational%20notebooks.%20Duplication%20allows%20notebook%20users%20to%20quickly%20test%20hypotheses%20and%20iterate%20over%20data.%20In%20this%20paper%2C%20we%20explore%20how%20much%2C%20how%20and%20from%20where%20code%20duplication%20occurs%20in%20computational%20notebooks%2C%20and%20identify%20potential%20barriers%20to%20code%20reuse.%20Previous%20work%20in%20the%20area%20of%20computational%20notebooks%20describes%20developers%27%20motivations%20for%20reuse%20and%20duplication%20but%20does%20not%20show%20how%20much%20reuse%20occurs%20or%20which%20barriers%20they%20face%20when%20reusing%20code.%20To%20address%20this%20gap%2C%20we%20first%20analyzed%20GitHub%20repositories%20for%20code%20duplicates%20contained%20in%20a%20repository%27s%20Jupyter%20notebooks%2C%20and%20then%20conducted%20an%20observational%20user%20study%20of%20code%20reuse%2C%20where%20participants%20solved%20specific%20tasks%20using%20notebooks.%20Our%20findings%20reveal%20that%20repositories%20in%20our%20sample%20have%20a%20mean%20self-duplication%20rate%20of%207.6%25.%20However%2C%20in%20our%20user%20study%2C%20few%20participants%20duplicated%20their%20own%20code%2C%20preferring%20to%20reuse%20code%20from%20online%20sources.%22%2C%22date%22%3A%222020%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2005.13709%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A06%3A35Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Reproducibility%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22LG4WZAIE%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Chattopadhyay%20et%20al.%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EChattopadhyay%2C%20Souti%2C%20Ishita%20Prasad%2C%20Austin%20Z.%20Henley%2C%20Anita%20Sarma%2C%20and%20Titus%20Barik.%20%26%23x201C%3BWhat%26%23x2019%3Bs%20Wrong%20with%20Computational%20Notebooks%3F%20Pain%20Points%2C%20Needs%2C%20and%20Design%20Opportunities.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%202020%20CHI%20Conference%20on%20Human%20Factors%20in%20Computing%20Systems%3C%5C%2Fi%3E%2C%201%26%23x2013%3B12.%20CHI%20%26%23x2019%3B20.%20Honolulu%2C%20HI%2C%20USA%3A%20Association%20for%20Computing%20Machinery%2C%202020.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3313831.3376729%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3313831.3376729%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DLG4WZAIE%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22What%27s%20Wrong%20with%20Computational%20Notebooks%3F%20Pain%20Points%2C%20Needs%2C%20and%20Design%20Opportunities%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Souti%22%2C%22lastName%22%3A%22Chattopadhyay%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ishita%22%2C%22lastName%22%3A%22Prasad%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Austin%20Z.%22%2C%22lastName%22%3A%22Henley%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anita%22%2C%22lastName%22%3A%22Sarma%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Titus%22%2C%22lastName%22%3A%22Barik%22%7D%5D%2C%22abstractNote%22%3A%22Computational%20notebooks%20-%20such%20as%20Azure%2C%20Databricks%2C%20and%20Jupyter%20-%20are%20a%20popular%2C%20interactive%20paradigm%20for%20data%20scientists%20to%20author%20code%2C%20analyze%20data%2C%20and%20interleave%20visualizations%2C%20all%20within%20a%20single%20document.%20Nevertheless%2C%20as%20data%20scientists%20incorporate%20more%20of%20their%20activities%20into%20notebooks%2C%20they%20encounter%20unexpected%20difficulties%2C%20or%20pain%20points%2C%20that%20impact%20their%20productivity%20and%20disrupt%20their%20workflow.%20Through%20a%20systematic%2C%20mixed-methods%20study%20using%20semi-structured%20interviews%20%28n%3D20%29%20and%20survey%20%28n%3D156%29%20with%20data%20scientists%2C%20we%20catalog%20nine%20pain%20points%20when%20working%20with%20notebooks.%20Our%20findings%20suggest%20that%20data%20scientists%20face%20numerous%20pain%20points%20throughout%20the%20entire%20workflow%20-%20from%20setting%20up%20notebooks%20to%20deploying%20to%20production%20-%20across%20many%20notebook%20environments.%20Our%20data%20scientists%20report%20essential%20notebook%20requirements%2C%20such%20as%20supporting%20data%20exploration%20and%20visualization.%20The%20results%20of%20our%20study%20inform%20and%20inspire%20the%20design%20of%20computational%20notebooks.%22%2C%22date%22%3A%222020%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%202020%20CHI%20Conference%20on%20Human%20Factors%20in%20Computing%20Systems%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1145%5C%2F3313831.3376729%22%2C%22ISBN%22%3A%22978-1-4503-6708-0%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3313831.3376729%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A05%3A04Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22ALXR57ZG%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22DePratti%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EDePratti%2C%20Roland.%20%26%23x201C%3BJupyter%20Notebooks%20versus%20a%20Textbook%20in%20a%20Big%20Data%20Course.%26%23x201D%3B%20%3Ci%3EJournal%20of%20Computing%20Sciences%20in%20Colleges%3C%5C%2Fi%3E%2035%2C%20no.%208%20%282020%29%3A%20208%26%23x2013%3B20.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fdoi%5C%2Fabs%5C%2F10.5555%5C%2F3417639.3417658%27%3Ehttps%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fdoi%5C%2Fabs%5C%2F10.5555%5C%2F3417639.3417658%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DALXR57ZG%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Jupyter%20notebooks%20versus%20a%20textbook%20in%20a%20big%20data%20course%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Roland%22%2C%22lastName%22%3A%22DePratti%22%7D%5D%2C%22abstractNote%22%3A%22In%20building%20curriculum%20in%20new%20areas%20of%20computer%20science%2C%20often%20the%20tools%20introduced%20in%20the%20course%20are%20an%20important%20component.%20This%20is%20especially%20true%20in%20the%20area%20of%20big%20data%2C%20where%20the%20complexity%20of%20the%20problems%20the%20area%20tackles%20is%20high.%20In%20the%204%20years%20since%20its%20inception%2C%20my%20big%20data%20course%20has%20gone%20through%20two%20major%20redesigns%20and%20has%20settled%20on%20a%20tool%20set%20including%3A%20the%20Hadoop%20platform%2C%20Spark%20processing%20engine%2C%20the%20Python%20programming%20language%2C%20Eclipse%20IDE%2C%20and%20Jupyter%20Notebooks.%20Many%20of%20the%20changes%20were%20driven%20by%20input%20from%20professional%20peers%20on%20big%20data%20teams%2C%20who%20were%20struggling%20with%20the%20complexity%20resulting%20from%20the%20low-level%20programming%20model%20used%20by%20MapReduce.%20Jupyter%20Notebook%2C%20a%20type%20of%20computational%20notebook%2C%20was%20added%20to%20the%20course%20to%20introduce%20students%20to%20the%20Python%20programming%20language.%20Data%20scientists%20and%20researchers%20have%20found%20computational%20notebooks%20an%20effective%20tool%20to%20manage%20their%20work%20by%20providing%20a%20way%20to%20track%20their%20thinking%20process%2C%20their%20code%2C%20and%20conclusions%20in%20one%20web%20document.%20To%20assess%20the%20effectiveness%20of%20using%20Jupyter%20Notebook%20in%20a%20big%20data%20course%2C%20students%27%20views%20on%20the%20use%20of%20computational%20notebooks%20and%20traditional%20textbooks%20were%20captured%20and%20statistically%20analyzed.%22%2C%22date%22%3A%222020%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fdoi%5C%2Fabs%5C%2F10.5555%5C%2F3417639.3417658%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T05%3A59%3A32Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22PSUNF3M9%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Willis%20et%20al.%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EWillis%2C%20Alistair%2C%20Patricia%20Charlton%2C%20and%20Tony%20Hirst.%20%26%23x201C%3BDeveloping%20Students%26%23x2019%3B%20Written%20Communication%20Skills%20with%20Jupyter%20Notebooks.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%2051st%20ACM%20Technical%20Symposium%20on%20Computer%20Science%20Education%3C%5C%2Fi%3E%2C%201089%26%23x2013%3B95.%20SIGCSE%20%26%23x2019%3B20.%20Portland%2C%20OR%2C%20USA%3A%20Association%20for%20Computing%20Machinery%2C%202020.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3328778.3366927%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3328778.3366927%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DPSUNF3M9%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Developing%20Students%27%20Written%20Communication%20Skills%20with%20Jupyter%20Notebooks%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alistair%22%2C%22lastName%22%3A%22Willis%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Patricia%22%2C%22lastName%22%3A%22Charlton%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Tony%22%2C%22lastName%22%3A%22Hirst%22%7D%5D%2C%22abstractNote%22%3A%22Written%20communication%20skills%20are%20considered%20to%20be%20highly%20desirable%20in%20computing%20graduates.%20However%2C%20many%20computing%20students%20do%20not%20have%20a%20background%20in%20which%20these%20skills%20have%20been%20developed%2C%20and%20the%20skills%20are%20often%20not%20well%20addressed%20within%20a%20computing%20curriculum.%20For%20some%20multidisciplinary%20areas%2C%20such%20as%20data%20science%2C%20the%20range%20of%20potential%20stakeholders%20makes%20the%20need%20for%20communications%20skills%20all%20the%20greater.%20As%20interest%20in%20data%20science%20increases%20and%20the%20technical%20skills%20of%20the%20area%20are%20in%20ever%20higher%20demand%2C%20understanding%20effective%20teaching%20and%20learning%20of%20these%20interdisciplinary%20aspects%20is%20receiving%20significant%20attention%20by%20academics%2C%20industry%20and%20government%20in%20an%20effort%20to%20address%20the%20digital%20skills%20gap.%20In%20this%20paper%2C%20we%20report%20on%20the%20experience%20of%20adapting%20a%20final%20year%20data%20science%20module%20in%20an%20undergraduate%20computing%20curriculum%20to%20help%20develop%20the%20skills%20needed%20for%20writing%20extended%20reports.%20From%20its%20inception%2C%20the%20module%20has%20used%20Jupyter%20notebooks%20to%20develop%20the%20students%27%20skills%20in%20the%20coding%20aspects%20of%20the%20module.%20However%2C%20over%20several%20presentations%2C%20we%20have%20investigated%20how%20the%20cell-based%20structure%20of%20the%20notebooks%20can%20be%20exploited%20to%20improve%20the%20students%27%20understanding%20of%20how%20to%20structure%20a%20report%20on%20a%20data%20investigation.%20We%20have%20increasingly%20designed%20the%20assessment%20for%20the%20module%20to%20take%20advantage%20of%20the%20learning%20affordances%20of%20Jupyter%20notebooks%20to%20support%20both%20raw%20data%20analysis%20and%20effective%20report%20writing.%20We%20reflect%20on%20the%20lessons%20learned%20from%20these%20changes%20to%20the%20assessment%20model%2C%20and%20the%20students%27%20responses%20to%20the%20changes.%22%2C%22date%22%3A%222020%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%2051st%20ACM%20Technical%20Symposium%20on%20Computer%20Science%20Education%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1145%5C%2F3328778.3366927%22%2C%22ISBN%22%3A%22978-1-4503-6793-6%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3328778.3366927%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T05%3A54%3A43Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%222U3BSY5G%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Thylstrup%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EThylstrup%2C%20Nanna%20Bonde%2C%20ed.%20%3Ci%3EUncertain%20Archives%3A%20Critical%20Keywords%20for%20Big%20Data%3C%5C%2Fi%3E.%20Cambridge%2C%20Massachusetts%3A%20The%20MIT%20Press%2C%202020.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D2U3BSY5G%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Uncertain%20archives%3A%20critical%20keywords%20for%20big%20data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Nanna%20Bonde%22%2C%22lastName%22%3A%22Thylstrup%22%7D%5D%2C%22abstractNote%22%3A%22Scholars%20from%20a%20range%20of%20disciplines%20interrogate%20terms%20relevant%20to%20critical%20studies%20of%20big%20data%2C%20from%20abuse%20and%20aggregate%20to%20visualization%20and%20vulnerability.%5Cn%5CnThis%20groundbreaking%20work%20offers%20an%20interdisciplinary%20perspective%20on%20big%20data%20and%20the%20archives%20they%20accrue%2C%20interrogating%20key%20terms.%20Scholars%20from%20a%20range%20of%20disciplines%20analyze%20concepts%20relevant%20to%20critical%20studies%20of%20big%20data%2C%20arranged%20glossary%20style%5Cu2014from%20abuse%20and%20aggregate%20to%20visualization%20and%20vulnerability.%20They%20not%20only%20challenge%20conventional%20usage%20of%20such%20familiar%20terms%20as%20prediction%20and%20objectivity%20but%20also%20introduce%20such%20unfamiliar%20ones%20as%20overfitting%20and%20copynorm.%20The%20contributors%20include%20a%20broad%20range%20of%20leading%20and%20agenda-setting%20scholars%2C%20including%20as%20N.%20Katherine%20Hayles%2C%20Wendy%20Hui%20Kyong%20Chun%2C%20Johanna%20Drucker%2C%20Lisa%20Gitelman%2C%20Safiya%20Noble%2C%20Sarah%20T.%20Roberts%20and%20Nicole%20Starosielski.%5Cn%5CnUncertainty%20is%20inherent%20to%20archival%20practices%3B%20the%20archive%20as%20a%20site%20of%20knowledge%20is%20fraught%20with%20unknowns%2C%20errors%2C%20and%20vulnerabilities%20that%20are%20present%2C%20and%20perhaps%20even%20amplified%2C%20in%20big%20data%20regimes.%20Bringing%20lessons%20from%20the%20study%20of%20the%20archive%20to%20bear%20on%20big%20data%2C%20the%20contributors%20consider%20the%20broader%20implications%20of%20big%20data%27s%20large-scale%20determination%20of%20knowledge.%22%2C%22date%22%3A%222020%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22978-0-262-53988-3%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-08-30T07%3A55%3A59Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Archives%20as%20paradigm%22%7D%2C%7B%22tag%22%3A%22Data%20mining%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22VCQ8ZIXE%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Kwak%20et%20al.%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EKwak%2C%20Haewoon%2C%20Jisun%20An%2C%20and%20Yong-Yeol%20Ahn.%20%26%23x201C%3BA%20Systematic%20Media%20Frame%20Analysis%20of%201.5%20Million%20New%20York%20Times%20Articles%20from%202000%20to%202017.%26%23x201D%3B%20%3Ci%3EArXiv%3A2005.01803%20%5BCs%5D%3C%5C%2Fi%3E%2C%202020.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2005.01803%27%3Ehttp%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2005.01803%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DVCQ8ZIXE%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22A%20Systematic%20Media%20Frame%20Analysis%20of%201.5%20Million%20New%20York%20Times%20Articles%20from%202000%20to%202017%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Haewoon%22%2C%22lastName%22%3A%22Kwak%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jisun%22%2C%22lastName%22%3A%22An%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yong-Yeol%22%2C%22lastName%22%3A%22Ahn%22%7D%5D%2C%22abstractNote%22%3A%22Framing%20is%20an%20indispensable%20narrative%20device%20for%20news%20media%20because%20even%20the%20same%20facts%20may%20lead%20to%20conflicting%20understandings%20if%20deliberate%20framing%20is%20employed.%20Therefore%2C%20identifying%20media%20framing%20is%20a%20crucial%20step%20to%20understanding%20how%20news%20media%20influence%20the%20public.%20Framing%20is%2C%20however%2C%20difficult%20to%20operationalize%20and%20detect%2C%20and%20thus%20traditional%20media%20framing%20studies%20had%20to%20rely%20on%20manual%20annotation%2C%20which%20is%20challenging%20to%20scale%20up%20to%20massive%20news%20datasets.%20Here%2C%20by%20developing%20a%20media%20frame%20classifier%20that%20achieves%20state-of-the-art%20performance%2C%20we%20systematically%20analyze%20the%20media%20frames%20of%201.5%20million%20New%20York%20Times%20articles%20published%20from%202000%20to%202017.%20By%20examining%20the%20ebb%20and%20flow%20of%20media%20frames%20over%20almost%20two%20decades%2C%20we%20show%20that%20short-term%20frame%20abundance%20fluctuation%20closely%20corresponds%20to%20major%20events%2C%20while%20there%20also%20exist%20several%20long-term%20trends%2C%20such%20as%20the%20gradually%20increasing%20prevalence%20of%20the%20%60%60Cultural%20identity%27%27%20frame.%20By%20examining%20specific%20topics%20and%20sentiments%2C%20we%20identify%20characteristics%20and%20dynamics%20of%20each%20frame.%20Finally%2C%20as%20a%20case%20study%2C%20we%20delve%20into%20the%20framing%20of%20mass%20shootings%2C%20revealing%20three%20major%20framing%20patterns.%20Our%20scalable%2C%20computational%20approach%20to%20massive%20news%20datasets%20opens%20up%20new%20pathways%20for%20systematic%20media%20framing%20studies.%22%2C%22date%22%3A%222020%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F2005.01803%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-08-15T23%3A13%3A36Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Frame%20analysis%20of%20media%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%2C%7B%22tag%22%3A%22Text%20classification%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22HK5Z5SCM%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Munro%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EMunro%2C%20Robert.%20%3Ci%3EHuman-in-the-Loop%20Machine%20Learning%3C%5C%2Fi%3E.%20Shelter%20Island%2C%20New%20York%3A%20Manning%2C%202020.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fwww.manning.com%5C%2Fbooks%5C%2Fhuman-in-the-loop-machine-learning%27%3Ehttps%3A%5C%2F%5C%2Fwww.manning.com%5C%2Fbooks%5C%2Fhuman-in-the-loop-machine-learning%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DHK5Z5SCM%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Human-in-the-Loop%20Machine%20Learning%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Robert%22%2C%22lastName%22%3A%22Munro%22%7D%5D%2C%22abstractNote%22%3A%22Most%20machine%20learning%20systems%20that%20are%20deployed%20in%20the%20world%20today%20learn%20from%20human%20feedback.%20However%2C%20most%20machine%20learning%20courses%20focus%20almost%20exclusively%20on%20the%20algorithms%2C%20not%20the%20human-computer%20interaction%20part%20of%20the%20systems.%20This%20can%20leave%20a%20big%20knowledge%20gap%20for%20data%20scientists%20working%20in%20real-world%20machine%20learning%2C%20where%20data%20scientists%20spend%20more%20time%20on%20data%20management%20than%20on%20building%20algorithms.%20Human-in-the-Loop%20Machine%20Learning%20is%20a%20practical%20guide%20to%20optimizing%20the%20entire%20machine%20learning%20process%2C%20including%20techniques%20for%20annotation%2C%20active%20learning%2C%20transfer%20learning%2C%20and%20using%20machine%20learning%20to%20optimize%20every%20step%20of%20the%20process.%22%2C%22date%22%3A%222020%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22978-1-61729-674-1%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.manning.com%5C%2Fbooks%5C%2Fhuman-in-the-loop-machine-learning%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-02-08T21%3A13%3A04Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Interpretability%20and%20explainability%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%5D%7D%7D%2C%7B%22key%22%3A%227YCYMALQ%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Wang%20et%20al.%22%2C%22parsedDate%22%3A%222019%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EWang%2C%20April%20Yi%2C%20Anant%20Mittal%2C%20Christopher%20Brooks%2C%20and%20Steve%20Oney.%20%26%23x201C%3BHow%20Data%20Scientists%20Use%20Computational%20Notebooks%20for%20Real-Time%20Collaboration.%26%23x201D%3B%20Association%20for%20Computing%20Machinery%2C%202019.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3359141%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3359141%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D7YCYMALQ%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22document%22%2C%22title%22%3A%22How%20Data%20Scientists%20Use%20Computational%20Notebooks%20for%20Real-Time%20Collaboration%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22April%20Yi%22%2C%22lastName%22%3A%22Wang%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Anant%22%2C%22lastName%22%3A%22Mittal%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christopher%22%2C%22lastName%22%3A%22Brooks%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Steve%22%2C%22lastName%22%3A%22Oney%22%7D%5D%2C%22abstractNote%22%3A%22Effective%20collaboration%20in%20data%20science%20can%20leverage%20domain%20expertise%20from%20each%20team%20member%20and%20thus%20improve%20the%20quality%20and%20efficiency%20of%20the%20work.%20Computational%20notebooks%20give%20data%20scientists%20a%20convenient%20interactive%20solution%20for%20sharing%20and%20keeping%20track%20of%20the%20data%20exploration%20process%20through%20a%20combination%20of%20code%2C%20narrative%20text%2C%20visualizations%2C%20and%20other%20rich%20media.%20In%20this%20paper%2C%20we%20report%20how%20synchronous%20editing%20in%20computational%20notebooks%20changes%20the%20way%20data%20scientists%20work%20together%20compared%20to%20working%20on%20individual%20notebooks.%20We%20first%20conducted%20a%20formative%20survey%20with%20195%20data%20scientists%20to%20understand%20their%20past%20experience%20with%20collaboration%20in%20the%20context%20of%20data%20science.%20Next%2C%20we%20carried%20out%20an%20observational%20study%20of%2024%20data%20scientists%20working%20in%20pairs%20remotely%20to%20solve%20a%20typical%20data%20science%20predictive%20modeling%20problem%2C%20working%20on%20either%20notebooks%20supported%20by%20synchronous%20groupware%20or%20individual%20notebooks%20in%20a%20collaborative%20setting.%20The%20study%20showed%20that%20working%20on%20the%20synchronous%20notebooks%20improves%20collaboration%20by%20creating%20a%20shared%20context%2C%20encouraging%20more%20exploration%2C%20and%20reducing%20communication%20costs.%20However%2C%20the%20current%20synchronous%20editing%20features%20may%20lead%20to%20unbalanced%20participation%20and%20activity%20interference%20without%20strategic%20coordination.%20The%20synchronous%20notebooks%20may%20also%20amplify%20the%20tension%20between%20quick%20exploration%20and%20clear%20explanations.%20Building%20on%20these%20findings%2C%20we%20propose%20several%20design%20implications%20aimed%20at%20better%20supporting%20collaborative%20editing%20in%20computational%20notebooks%2C%20and%20thus%20improving%20efficiency%20in%20teamwork%20among%20data%20scientists.%22%2C%22date%22%3A%222019%22%2C%22language%22%3A%22en%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3359141%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A14%3A40Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22DIYQUIAZ%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Rule%20et%20al.%22%2C%22parsedDate%22%3A%222019%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ERule%2C%20Adam%2C%20Amanda%20Birmingham%2C%20Cristal%20Zuniga%2C%20Ilkay%20Altintas%2C%20Shih-Cheng%20Huang%2C%20Rob%20Knight%2C%20Niema%20Moshiri%2C%20et%20al.%20%26%23x201C%3BTen%20Simple%20Rules%20for%20Writing%20and%20Sharing%20Computational%20Analyses%20in%20Jupyter%20Notebooks.%26%23x201D%3B%20%3Ci%3EPLOS%20Computational%20Biology%3C%5C%2Fi%3E%2015%2C%20no.%207%20%282019%29%3A%20e1007007.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pcbi.1007007%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pcbi.1007007%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DDIYQUIAZ%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Ten%20simple%20rules%20for%20writing%20and%20sharing%20computational%20analyses%20in%20Jupyter%20Notebooks%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Adam%22%2C%22lastName%22%3A%22Rule%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Amanda%22%2C%22lastName%22%3A%22Birmingham%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Cristal%22%2C%22lastName%22%3A%22Zuniga%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ilkay%22%2C%22lastName%22%3A%22Altintas%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Shih-Cheng%22%2C%22lastName%22%3A%22Huang%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Rob%22%2C%22lastName%22%3A%22Knight%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Niema%22%2C%22lastName%22%3A%22Moshiri%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mai%20H.%22%2C%22lastName%22%3A%22Nguyen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sara%20Brin%22%2C%22lastName%22%3A%22Rosenthal%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Fernando%22%2C%22lastName%22%3A%22P%5Cu00e9rez%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Peter%20W.%22%2C%22lastName%22%3A%22Rose%22%7D%5D%2C%22abstractNote%22%3A%22As%20studies%20grow%20in%20scale%20and%20complexity%2C%20it%20has%20become%20increasingly%20difficult%20to%20provide%20clear%20descriptions%20and%20open%20access%20to%20the%20methods%20and%20data%20needed%20to%20understand%20and%20reproduce%20computational%20research.%20Numerous%20papers%2C%20including%20several%20in%20the%20Ten%20Simple%20Rules%20collection%2C%20have%20highlighted%20the%20need%20for%20robust%20and%20reproducible%20analyses%20in%20computational%20research%2C%20described%20the%20difficulty%20of%20achieving%20these%20standards%2C%20and%20enumerated%20best%20practices.%20We%20aim%20to%20augment%20this%20existing%20wellspring%20of%20advice%20by%20addressing%20the%20unique%20challenges%20and%20opportunities%20that%20arise%20when%20using%20computational%20notebooks%2C%20especially%20Jupyter%20Notebooks%2C%20for%20research.%22%2C%22date%22%3A%222019%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1371%5C%2Fjournal.pcbi.1007007%22%2C%22ISSN%22%3A%221553-7358%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fjournals.plos.org%5C%2Fploscompbiol%5C%2Farticle%3Fid%3D10.1371%5C%2Fjournal.pcbi.1007007%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T05%3A31%3A36Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Open%20science%22%7D%2C%7B%22tag%22%3A%22Reproducibility%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22QJ4XZV2F%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Pandey%22%2C%22parsedDate%22%3A%222019%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EPandey%2C%20Parul.%20%3Ci%3EInterpretable%20Machine%20Learning%3C%5C%2Fi%3E%2C%202019.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Finterpretable-machine-learning-1dec0f2f3e6b%27%3Ehttps%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Finterpretable-machine-learning-1dec0f2f3e6b%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DQJ4XZV2F%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Interpretable%20Machine%20Learning%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Parul%22%2C%22lastName%22%3A%22Pandey%22%7D%5D%2C%22abstractNote%22%3A%22Extracting%20human%20understandable%20insights%20from%20any%20Machine%20Learning%20model%22%2C%22date%22%3A%222019%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Finterpretable-machine-learning-1dec0f2f3e6b%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A40%3A47Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Interpretability%20and%20explainability%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22SWIMDFE4%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Wikipedia%22%2C%22parsedDate%22%3A%222019%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EWikipedia.%20%3Ci%3EConfusion%20Matrix%3C%5C%2Fi%3E%2C%202019.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fen.wikipedia.org%5C%2Fw%5C%2Findex.php%3Ftitle%3DConfusion_matrix%26oldid%3D881721342%27%3Ehttps%3A%5C%2F%5C%2Fen.wikipedia.org%5C%2Fw%5C%2Findex.php%3Ftitle%3DConfusion_matrix%26oldid%3D881721342%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DSWIMDFE4%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Confusion%20matrix%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22name%22%3A%22Wikipedia%22%7D%5D%2C%22abstractNote%22%3A%22In%20the%20field%20of%20machine%20learning%20and%20specifically%20the%20problem%20of%20statistical%20classification%2C%20a%20confusion%20matrix%2C%20also%20known%20as%20an%20error%20matrix%2C%20is%20a%20specific%20table%20layout%20that%20allows%20visualization%20of%20the%20performance%20of%20an%20algorithm%2C%20typically%20a%20supervised%20learning%20one%20%28in%20unsupervised%20learning%20it%20is%20usually%20called%20a%20matching%20matrix%29.%20Each%20row%20of%20the%20matrix%20represents%20the%20instances%20in%20a%20predicted%20class%20while%20each%20column%20represents%20the%20instances%20in%20an%20actual%20class%20%28or%20vice%20versa%29.%20The%20name%20stems%20from%20the%20fact%20that%20it%20makes%20it%20easy%20to%20see%20if%20the%20system%20is%20confusing%20two%20classes%20%28i.e.%20commonly%20mislabeling%20one%20as%20another%29.%22%2C%22date%22%3A%222019%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fen.wikipedia.org%5C%2Fw%5C%2Findex.php%3Ftitle%3DConfusion_matrix%26oldid%3D881721342%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A48Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Statistics%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22YQ7VLT2R%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22parsedDate%22%3A%222018%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3E%26%23x201C%3BBig%20Data%20Technologies%3A%20A%20Survey.%26%23x201D%3B%20%3Ci%3EJournal%20of%20King%20Saud%20University%20-%20Computer%20and%20Information%20Sciences%3C%5C%2Fi%3E%2030%2C%20no.%204%20%282018%29%3A%20431%26%23x2013%3B48.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.jksuci.2017.06.001%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.jksuci.2017.06.001%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DYQ7VLT2R%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Big%20Data%20technologies%3A%20A%20survey%22%2C%22creators%22%3A%5B%5D%2C%22abstractNote%22%3A%22Developing%20Big%20Data%20applications%20has%20become%20increasingly%20important%20in%20the%20last%20few%20years.%20In%20fact%2C%20several%20organizations%20from%20different%20sectors%20depend%20increasingly%20on%20knowledge%20extracted%20from%20huge%20volumes%20of%20data.%20However%2C%20in%20Big%20Data%20context%2C%20traditional%20data%20techniques%20and%20platforms%20are%20less%20efficient.%20They%20show%20a%20slow%20responsiveness%20and%20lack%20of%20scalability%2C%20performance%20and%20accuracy.%20To%20face%20the%20complex%20Big%20Data%20challenges%2C%20much%20work%20has%20been%20carried%20out.%20As%20a%20result%2C%20various%20types%20of%20distributions%20and%20technologies%20have%20been%20developed.%20This%20paper%20is%20a%20review%20that%20survey%20recent%20technologies%20developed%20for%20Big%20Data.%20It%20aims%20to%20help%20to%20select%20and%20adopt%20the%20right%20combination%20of%20different%20Big%20Data%20technologies%20according%20to%20their%20technological%20needs%20and%20specific%20applications%5Cu2019%20requirements.%20It%20provides%20not%20only%20a%20global%20view%20of%20main%20Big%20Data%20technologies%20but%20also%20comparisons%20according%20to%20different%20system%20layers%20such%20as%20Data%20Storage%20Layer%2C%20Data%20Processing%20Layer%2C%20Data%20Querying%20Layer%2C%20Data%20Access%20Layer%20and%20Management%20Layer.%20It%20categorizes%20and%20discusses%20main%20technologies%20features%2C%20advantages%2C%20limits%20and%20usages.%22%2C%22date%22%3A%222018%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1016%5C%2Fj.jksuci.2017.06.001%22%2C%22ISSN%22%3A%221319-1578%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.sciencedirect.com%5C%2Fscience%5C%2Farticle%5C%2Fpii%5C%2FS1319157817300034%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A52%3A07Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22NI4HVKMD%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Kery%20et%20al.%22%2C%22parsedDate%22%3A%222018%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EKery%2C%20Mary%20Beth%2C%20Marissa%20Radensky%2C%20Mahima%20Arya%2C%20Bonnie%20E.%20John%2C%20and%20Brad%20A.%20Myers.%20%26%23x201C%3BThe%20Story%20in%20the%20Notebook%3A%20Exploratory%20Data%20Science%20Using%20a%20Literate%20Programming%20Tool.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%202018%20CHI%20Conference%20on%20Human%20Factors%20in%20Computing%20Systems%3C%5C%2Fi%3E%2C%201%26%23x2013%3B11.%20CHI%20%26%23x2019%3B18.%20Montreal%20QC%2C%20Canada%3A%20Association%20for%20Computing%20Machinery%2C%202018.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3173574.3173748%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3173574.3173748%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DNI4HVKMD%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22The%20Story%20in%20the%20Notebook%3A%20Exploratory%20Data%20Science%20using%20a%20Literate%20Programming%20Tool%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mary%20Beth%22%2C%22lastName%22%3A%22Kery%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marissa%22%2C%22lastName%22%3A%22Radensky%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mahima%22%2C%22lastName%22%3A%22Arya%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Bonnie%20E.%22%2C%22lastName%22%3A%22John%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Brad%20A.%22%2C%22lastName%22%3A%22Myers%22%7D%5D%2C%22abstractNote%22%3A%22Literate%20programming%20tools%20are%20used%20by%20millions%20of%20programmers%20today%2C%20and%20are%20intended%20to%20facilitate%20presenting%20data%20analyses%20in%20the%20form%20of%20a%20narrative.%20We%20interviewed%2021%20data%20scientists%20to%20study%20coding%20behaviors%20in%20a%20literate%20programming%20environment%20and%20how%20data%20scientists%20kept%20track%20of%20variants%20they%20explored.%20For%20participants%20who%20tried%20to%20keep%20a%20detailed%20history%20of%20their%20experimentation%2C%20both%20informal%20and%20formal%20versioning%20attempts%20led%20to%20problems%2C%20such%20as%20reduced%20notebook%20readability.%20During%20iteration%2C%20participants%20actively%20curated%20their%20notebooks%20into%20narratives%2C%20although%20primarily%20through%20cell%20structure%20rather%20than%20markdown%20explanations.%20Next%2C%20we%20surveyed%2045%20data%20scientists%20and%20asked%20them%20to%20envision%20how%20they%20might%20use%20their%20past%20history%20in%20an%20future%20version%20control%20system.%20Based%20on%20these%20results%2C%20we%20give%20design%20guidance%20for%20future%20literate%20programming%20tools%2C%20such%20as%20providing%20history%20search%20based%20on%20how%20programmers%20recall%20their%20explorations%2C%20through%20contextual%20details%20including%20images%20and%20parameters.%22%2C%22date%22%3A%222018%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%202018%20CHI%20Conference%20on%20Human%20Factors%20in%20Computing%20Systems%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1145%5C%2F3173574.3173748%22%2C%22ISBN%22%3A%22978-1-4503-5620-6%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3173574.3173748%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T05%3A33%3A10Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20narrative%22%7D%2C%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%228KT424LE%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Narkhede%22%2C%22parsedDate%22%3A%222018%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ENarkhede%2C%20Sarang.%20%3Ci%3EUnderstanding%20Confusion%20Matrix%3C%5C%2Fi%3E%2C%202018.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Funderstanding-confusion-matrix-a9ad42dcfd62%27%3Ehttps%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Funderstanding-confusion-matrix-a9ad42dcfd62%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D8KT424LE%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Understanding%20Confusion%20Matrix%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sarang%22%2C%22lastName%22%3A%22Narkhede%22%7D%5D%2C%22abstractNote%22%3A%22When%20data%20is%20gathered%2C%20after%20data%20cleaning%2C%20pre-processing%20and%20wrangling%2C%20the%20first%20step%20is%20to%20feed%20it%20to%20an%20outstanding%20model%20and%20of%20course%2C%20get%20output%20in%20probabilities.%20But%20hold%20on%21%20How%20in%20the%20hell%20can%20one%20measure%20the%20effectiveness%20of%20their%20model%3F%20The%20better%20the%20effectiveness%2C%20the%20better%20the%20performance%2C%20and%20that%5Cu2019s%20exactly%20what%20we%20want.%20And%20it%20is%20where%20the%20Confusion%20matrix%20comes%20into%20the%20limelight.%20Confusion%20Matrix%20is%20a%20performance%20measurement%20for%20machine%20learning%20classification.%20This%20blog%20aims%20to%20answer%20following%20questions%3A%20What%20the%20confusion%20matrix%20is%20and%20why%20you%20need%20it%3F%20How%20to%20calculate%20Confusion%20Matrix%20for%20a%202-class%20classification%20problem%3F%22%2C%22date%22%3A%222018%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Funderstanding-confusion-matrix-a9ad42dcfd62%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A40%3A47Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Statistics%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%5D%7D%7D%2C%7B%22key%22%3A%229MQ3YIES%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Kleinman%20et%20al.%22%2C%22parsedDate%22%3A%222018%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EKleinman%2C%20Scott%2C%20Mark%20D.%20LeBlanc%2C%20and%20Michael%20Drout.%20%3Ci%3EHierarchical%20Clustering%3C%5C%2Fi%3E%2C%202018.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Fscalar.usc.edu%5C%2Fworks%5C%2Flexos%5C%2Fhierarchical-clustering%3Fpath%3Dmanual%27%3Ehttp%3A%5C%2F%5C%2Fscalar.usc.edu%5C%2Fworks%5C%2Flexos%5C%2Fhierarchical-clustering%3Fpath%3Dmanual%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D9MQ3YIES%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Hierarchical%20clustering%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Scott%22%2C%22lastName%22%3A%22Kleinman%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mark%20D.%22%2C%22lastName%22%3A%22LeBlanc%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michael%22%2C%22lastName%22%3A%22Drout%22%7D%5D%2C%22abstractNote%22%3A%22Hierarchical%20cluster%20analysis%20is%20a%20good%20first%20choice%20when%20asking%20new%20questions%20about%20texts%2C%20this%20approach%20is%20remarkably%20versatile%20%28REF%29.%20Perhaps%20more%20than%20any%20one%20individual%20method%2C%20the%20results%20from%20the%20cluster%20analyses%20continue%20to%20generate%20new%2C%20interesting%2C%20and%20focused%20questions.%22%2C%22date%22%3A%222018%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fscalar.usc.edu%5C%2Fworks%5C%2Flexos%5C%2Fhierarchical-clustering%3Fpath%3Dmanual%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A45Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Data%20visualization%22%7D%2C%7B%22tag%22%3A%22Hierarchical%20clustering%22%7D%2C%7B%22tag%22%3A%22Topic%20clusters%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22SQR5EAAL%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Randles%20et%20al.%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ERandles%2C%20Bernadette%20M.%2C%20Irene%20V.%20Pasquetto%2C%20Milena%20S.%20Golshan%2C%20and%20Christine%20L.%20Borgman.%20%26%23x201C%3BUsing%20the%20Jupyter%20Notebook%20as%20a%20Tool%20for%20Open%20Science%3A%20An%20Empirical%20Study.%26%23x201D%3B%20In%20%3Ci%3E2017%20ACM%5C%2FIEEE%20Joint%20Conference%20on%20Digital%20Libraries%20%28JCDL%29%3C%5C%2Fi%3E%2C%201%26%23x2013%3B2%2C%202017.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FJCDL.2017.7991618%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FJCDL.2017.7991618%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DSQR5EAAL%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Using%20the%20Jupyter%20Notebook%20as%20a%20Tool%20for%20Open%20Science%3A%20An%20Empirical%20Study%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Bernadette%20M.%22%2C%22lastName%22%3A%22Randles%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Irene%20V.%22%2C%22lastName%22%3A%22Pasquetto%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Milena%20S.%22%2C%22lastName%22%3A%22Golshan%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christine%20L.%22%2C%22lastName%22%3A%22Borgman%22%7D%5D%2C%22abstractNote%22%3A%22As%20scientific%20work%20becomes%20more%20computational%20and%20data-intensive%2C%20research%20processes%20and%20results%20become%20more%20difficult%20to%20interpret%20and%20reproduce.%20In%20this%20poster%2C%20we%20show%20how%20the%20Jupyter%20notebook%2C%20a%20tool%20originally%20designed%20as%20a%20free%20version%20of%20Mathematica%20notebooks%2C%20has%20evolved%20to%20become%20a%20robust%20tool%20for%20scientists%20to%20share%20code%2C%20associated%20computation%2C%20and%20documentation.%22%2C%22date%22%3A%222017%22%2C%22proceedingsTitle%22%3A%222017%20ACM%5C%2FIEEE%20Joint%20Conference%20on%20Digital%20Libraries%20%28JCDL%29%22%2C%22conferenceName%22%3A%222017%20ACM%5C%2FIEEE%20Joint%20Conference%20on%20Digital%20Libraries%20%28JCDL%29%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1109%5C%2FJCDL.2017.7991618%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T05%3A49%3A56Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Data%20visualization%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%2C%7B%22tag%22%3A%22Open%20science%22%7D%2C%7B%22tag%22%3A%22Statistics%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22TB26BFG7%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Jupyter%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EJupyter%2C%20Project.%20%26%23x201C%3BProject%20Jupyter%3A%20Computational%20Narratives%20as%20the%20Engine%20of%20Collaborative%20Data%20Science.%26%23x201D%3B%20Medium%2C%202017.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fblog.jupyter.org%5C%2Fproject-jupyter-computational-narratives-as-the-engine-of-collaborative-data-science-2b5fb94c3c58%27%3Ehttps%3A%5C%2F%5C%2Fblog.jupyter.org%5C%2Fproject-jupyter-computational-narratives-as-the-engine-of-collaborative-data-science-2b5fb94c3c58%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DTB26BFG7%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22webpage%22%2C%22title%22%3A%22Project%20Jupyter%3A%20Computational%20Narratives%20as%20the%20Engine%20of%20Collaborative%20Data%20Science%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Project%22%2C%22lastName%22%3A%22Jupyter%22%7D%5D%2C%22abstractNote%22%3A%22This%20is%20the%20full%20text%20of%20the%20grant%20proposal%20that%20was%20funded%20by%20the%20Helmsley%20Trust%2C%20the%20Gordon%20and%20Betty%20Moore%20Foundation%20and%20the%20Alfred%20P.%20Sloan%20Foundation%20on%20April%202015%2C%20as%20described%20on%20these%20two%20announcements%20from%20UC%20Berkeley%20and%20Cal%20Poly%2C%20and%20press%20releases%20from%20the%20Helmsley%20Trust%20and%20the%20Moore%20Foundation.%22%2C%22date%22%3A%222017%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fblog.jupyter.org%5C%2Fproject-jupyter-computational-narratives-as-the-engine-of-collaborative-data-science-2b5fb94c3c58%22%2C%22language%22%3A%22en%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T05%3A29%3A05Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20narrative%22%7D%2C%7B%22tag%22%3A%22Data%20notebooks%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22TN35EMBP%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Ruchansky%20et%20al.%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ERuchansky%2C%20Natali%2C%20Sungyong%20Seo%2C%20and%20Yan%20Liu.%20%26%23x201C%3BCSI%3A%20A%20Hybrid%20Deep%20Model%20for%20Fake%20News%20Detection.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%202017%20ACM%20on%20Conference%20on%20Information%20and%20Knowledge%20Management%3C%5C%2Fi%3E%2C%20797%26%23x2013%3B806.%20CIKM%20%26%23x2019%3B17.%20Singapore%2C%20Singapore%3A%20Association%20for%20Computing%20Machinery%2C%202017.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3132847.3132877%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3132847.3132877%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DTN35EMBP%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22CSI%3A%20A%20Hybrid%20Deep%20Model%20for%20Fake%20News%20Detection%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Natali%22%2C%22lastName%22%3A%22Ruchansky%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sungyong%22%2C%22lastName%22%3A%22Seo%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yan%22%2C%22lastName%22%3A%22Liu%22%7D%5D%2C%22abstractNote%22%3A%22The%20topic%20of%20fake%20news%20has%20drawn%20attention%20both%20from%20the%20public%20and%20the%20academic%20communities.%20Such%20misinformation%20has%20the%20potential%20of%20affecting%20public%20opinion%2C%20providing%20an%20opportunity%20for%20malicious%20parties%20to%20manipulate%20the%20outcomes%20of%20public%20events%20such%20as%20elections.%20Because%20such%20high%20stakes%20are%20at%20play%2C%20automatically%20detecting%20fake%20news%20is%20an%20important%2C%20yet%20challenging%20problem%20that%20is%20not%20yet%20well%20understood.%20Nevertheless%2C%20there%20are%20three%20generally%20agreed%20upon%20characteristics%20of%20fake%20news%3A%20the%20text%20of%20an%20article%2C%20the%20user%20response%20it%20receives%2C%20and%20the%20source%20users%20promoting%20it.%20Existing%20work%20has%20largely%20focused%20on%20tailoring%20solutions%20to%20one%20particular%20characteristic%20which%20has%20limited%20their%20success%20and%20generality.%20In%20this%20work%2C%20we%20propose%20a%20model%20that%20combines%20all%20three%20characteristics%20for%20a%20more%20accurate%20and%20automated%20prediction.%20Specifically%2C%20we%20incorporate%20the%20behavior%20of%20both%20parties%2C%20users%20and%20articles%2C%20and%20the%20group%20behavior%20of%20users%20who%20propagate%20fake%20news.%20Motivated%20by%20the%20three%20characteristics%2C%20we%20propose%20a%20model%20called%20CSI%20which%20is%20composed%20of%20three%20modules%3A%20Capture%2C%20Score%2C%20and%20Integrate.%20The%20first%20module%20is%20based%20on%20the%20response%20and%20text%3B%20it%20uses%20a%20Recurrent%20Neural%20Network%20to%20capture%20the%20temporal%20pattern%20of%20user%20activity%20on%20a%20given%20article.%20The%20second%20module%20learns%20the%20source%20characteristic%20based%20on%20the%20behavior%20of%20users%2C%20and%20the%20two%20are%20integrated%20with%20the%20third%20module%20to%20classify%20an%20article%20as%20fake%20or%20not.%20Experimental%20analysis%20on%20real-world%20data%20demonstrates%20that%20CSI%20achieves%20higher%20accuracy%20than%20existing%20models%2C%20and%20extracts%20meaningful%20latent%20representations%20of%20both%20users%20and%20articles.%22%2C%22date%22%3A%222017%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%202017%20ACM%20on%20Conference%20on%20Information%20and%20Knowledge%20Management%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1145%5C%2F3132847.3132877%22%2C%22ISBN%22%3A%22978-1-4503-4918-5%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F3132847.3132877%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-04-01T07%3A08%3A45Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Artificial%20intelligence%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Fake%20news%22%7D%2C%7B%22tag%22%3A%22Journalism%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22P9D2I8BZ%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Wang%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EWang%2C%20William%20Yang.%20%26%23x201C%3B%26%23x2018%3BLiar%2C%20Liar%20Pants%20on%20Fire%26%23x2019%3B%3A%20A%20New%20Benchmark%20Dataset%20for%20Fake%20News%20Detection.%26%23x201D%3B%20%3Ci%3EArXiv%3A1705.00648%20%5BCs%5D%3C%5C%2Fi%3E%2C%202017.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1705.00648%27%3Ehttp%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1705.00648%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DP9D2I8BZ%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22%5C%22Liar%2C%20Liar%20Pants%20on%20Fire%5C%22%3A%20A%20New%20Benchmark%20Dataset%20for%20Fake%20News%20Detection%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22William%20Yang%22%2C%22lastName%22%3A%22Wang%22%7D%5D%2C%22abstractNote%22%3A%22Automatic%20fake%20news%20detection%20is%20a%20challenging%20problem%20in%20deception%20detection%2C%20and%20it%20has%20tremendous%20real-world%20political%20and%20social%20impacts.%20However%2C%20statistical%20approaches%20to%20combating%20fake%20news%20has%20been%20dramatically%20limited%20by%20the%20lack%20of%20labeled%20benchmark%20datasets.%20In%20this%20paper%2C%20we%20present%20liar%3A%20a%20new%2C%20publicly%20available%20dataset%20for%20fake%20news%20detection.%20We%20collected%20a%20decade-long%2C%2012.8K%20manually%20labeled%20short%20statements%20in%20various%20contexts%20from%20PolitiFact.com%2C%20which%20provides%20detailed%20analysis%20report%20and%20links%20to%20source%20documents%20for%20each%20case.%20This%20dataset%20can%20be%20used%20for%20fact-checking%20research%20as%20well.%20Notably%2C%20this%20new%20dataset%20is%20an%20order%20of%20magnitude%20larger%20than%20previously%20largest%20public%20fake%20news%20datasets%20of%20similar%20type.%20Empirically%2C%20we%20investigate%20automatic%20fake%20news%20detection%20based%20on%20surface-level%20linguistic%20patterns.%20We%20have%20designed%20a%20novel%2C%20hybrid%20convolutional%20neural%20network%20to%20integrate%20meta-data%20with%20text.%20We%20show%20that%20this%20hybrid%20approach%20can%20improve%20a%20text-only%20deep%20learning%20model.%22%2C%22date%22%3A%222017%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1705.00648%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-04-01T07%3A06%3A50Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Artificial%20intelligence%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Fake%20news%22%7D%2C%7B%22tag%22%3A%22Journalism%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%5D%7D%7D%2C%7B%22key%22%3A%227FJLHL6M%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22M%5Cu00fctzel%22%2C%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EM%26%23xFC%3Btzel%2C%20Sophie.%20%26%23x201C%3BFacing%20Big%20Data%3A%20Making%20Sociology%20Relevant%20%2C%20Facing%20Big%20Data%3A%20Making%20Sociology%20Relevant.%26%23x201D%3B%20%3Ci%3EBig%20Data%20%26amp%3B%20Society%3C%5C%2Fi%3E%202%2C%20no.%202%20%282015%29%3A%202053951715599179.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951715599179%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951715599179%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D7FJLHL6M%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Facing%20Big%20Data%3A%20Making%20sociology%20relevant%20%2C%20Facing%20Big%20Data%3A%20Making%20sociology%20relevant%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sophie%22%2C%22lastName%22%3A%22M%5Cu00fctzel%22%7D%5D%2C%22abstractNote%22%3A%22Working%20with%20computational%20methods%20and%20large%20textual%20analysis%20has%20been%20challenging%20and%20very%20rewarding%5Cu2014with%20all%20the%20ups%20and%20downs%20that%20doing%20empirical%20social%20research%20entails.%20In%20her%20contribution%2C%20M%5Cu00fctzel%20relates%20some%20research%20experiences%20and%20reflects%20upon%20data%20construction%20and%20the%20links%20between%20theory%2C%20data%2C%20and%20methods.%22%2C%22date%22%3A%222015%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1177%5C%2F2053951715599179%22%2C%22ISSN%22%3A%222053-9517%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951715599179%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A39%3A26Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22L4PN47EA%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Hashem%20et%20al.%22%2C%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EHashem%2C%20Ibrahim%20Abaker%20Targio%2C%20Ibrar%20Yaqoob%2C%20Nor%20Badrul%20Anuar%2C%20Salimah%20Mokhtar%2C%20Abdullah%20Gani%2C%20and%20Samee%20Ullah%20Khan.%20%26%23x201C%3BThe%20Rise%20of%20%26%23x2018%3BBig%20Data%26%23x2019%3B%20on%20Cloud%20Computing%3A%20Review%20and%20Open%20Research%20Issues.%26%23x201D%3B%20%3Ci%3EInformation%20Systems%3C%5C%2Fi%3E%2047%20%282015%29%3A%2098%26%23x2013%3B115.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.is.2014.07.006%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.is.2014.07.006%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DL4PN47EA%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22The%20rise%20of%20%5Cu201cbig%20data%5Cu201d%20on%20cloud%20computing%3A%20Review%20and%20open%20research%20issues%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ibrahim%20Abaker%20Targio%22%2C%22lastName%22%3A%22Hashem%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ibrar%22%2C%22lastName%22%3A%22Yaqoob%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Nor%20Badrul%22%2C%22lastName%22%3A%22Anuar%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Salimah%22%2C%22lastName%22%3A%22Mokhtar%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Abdullah%22%2C%22lastName%22%3A%22Gani%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Samee%22%2C%22lastName%22%3A%22Ullah%20Khan%22%7D%5D%2C%22abstractNote%22%3A%22Cloud%20computing%20is%20a%20powerful%20technology%20to%20perform%20massive-scale%20and%20complex%20computing.%20It%20eliminates%20the%20need%20to%20maintain%20expensive%20computing%20hardware%2C%20dedicated%20space%2C%20and%20software.%20Massive%20growth%20in%20the%20scale%20of%20data%20or%20big%20data%20generated%20through%20cloud%20computing%20has%20been%20observed.%20Addressing%20big%20data%20is%20a%20challenging%20and%20time-demanding%20task%20that%20requires%20a%20large%20computational%20infrastructure%20to%20ensure%20successful%20data%20processing%20and%20analysis.%20The%20rise%20of%20big%20data%20in%20cloud%20computing%20is%20reviewed%20in%20this%20study.%20The%20definition%2C%20characteristics%2C%20and%20classification%20of%20big%20data%20along%20with%20some%20discussions%20on%20cloud%20computing%20are%20introduced.%20The%20relationship%20between%20big%20data%20and%20cloud%20computing%2C%20big%20data%20storage%20systems%2C%20and%20Hadoop%20technology%20are%20also%20discussed.%20Furthermore%2C%20research%20challenges%20are%20investigated%2C%20with%20focus%20on%20scalability%2C%20availability%2C%20data%20integrity%2C%20data%20transformation%2C%20data%20quality%2C%20data%20heterogeneity%2C%20privacy%2C%20legal%20and%20regulatory%20issues%2C%20and%20governance.%20Lastly%2C%20open%20research%20issues%20that%20require%20substantial%20research%20efforts%20are%20summarized.%22%2C%22date%22%3A%222015%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1016%5C%2Fj.is.2014.07.006%22%2C%22ISSN%22%3A%220306-4379%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fwww.sciencedirect.com%5C%2Fscience%5C%2Farticle%5C%2Fpii%5C%2FS0306437914001288%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T07%3A06%3A01Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%226ZQEXB72%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3E%26%23x201C%3BBeyond%20the%20Hype%3A%20Big%20Data%20Concepts%2C%20Methods%2C%20and%20Analytics.%26%23x201D%3B%20%3Ci%3EInternational%20Journal%20of%20Information%20Management%3C%5C%2Fi%3E%2035%2C%20no.%202%20%282015%29%3A%20137%26%23x2013%3B44.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.ijinfomgt.2014.10.007%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.ijinfomgt.2014.10.007%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D6ZQEXB72%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Beyond%20the%20hype%3A%20Big%20data%20concepts%2C%20methods%2C%20and%20analytics%22%2C%22creators%22%3A%5B%5D%2C%22abstractNote%22%3A%22Size%20is%20the%20first%2C%20and%20at%20times%2C%20the%20only%20dimension%20that%20leaps%20out%20at%20the%20mention%20of%20big%20data.%20This%20paper%20attempts%20to%20offer%20a%20broader%20definition%20of%20big%20data%20that%20captures%20its%20other%20unique%20and%20defining%20characteristics.%20The%20rapid%20evolution%20and%20adoption%20of%20big%20data%20by%20industry%20has%20leapfrogged%20the%20discourse%20to%20popular%20outlets%2C%20forcing%20the%20academic%20press%20to%20catch%20up.%20Academic%20journals%20in%20numerous%20disciplines%2C%20which%20will%20benefit%20from%20a%20relevant%20discussion%20of%20big%20data%2C%20have%20yet%20to%20cover%20the%20topic.%20This%20paper%20presents%20a%20consolidated%20description%20of%20big%20data%20by%20integrating%20definitions%20from%20practitioners%20and%20academics.%20The%20paper%27s%20primary%20focus%20is%20on%20the%20analytic%20methods%20used%20for%20big%20data.%20A%20particular%20distinguishing%20feature%20of%20this%20paper%20is%20its%20focus%20on%20analytics%20related%20to%20unstructured%20data%2C%20which%20constitute%2095%25%20of%20big%20data.%20This%20paper%20highlights%20the%20need%20to%20develop%20appropriate%20and%20efficient%20analytical%20methods%20to%20leverage%20massive%20volumes%20of%20heterogeneous%20data%20in%20unstructured%20text%2C%20audio%2C%20and%20video%20formats.%20This%20paper%20also%20reinforces%20the%20need%20to%20devise%20new%20tools%20for%20predictive%20analytics%20for%20structured%20big%20data.%20The%20statistical%20methods%20in%20practice%20were%20devised%20to%20infer%20from%20sample%20data.%20The%20heterogeneity%2C%20noise%2C%20and%20the%20massive%20size%20of%20structured%20big%20data%20calls%20for%20developing%20computationally%20efficient%20algorithms%20that%20may%20avoid%20big%20data%20pitfalls%2C%20such%20as%20spurious%20correlation.%22%2C%22date%22%3A%222015%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1016%5C%2Fj.ijinfomgt.2014.10.007%22%2C%22ISSN%22%3A%220268-4012%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.sciencedirect.com%5C%2Fscience%5C%2Farticle%5C%2Fpii%5C%2FS0268401214001066%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A50%3A30Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22SIMVU2YD%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Ahonen%22%2C%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EAhonen%2C%20Pertti.%20%26%23x201C%3BInstitutionalizing%20Big%20Data%20Methods%20in%20Social%20and%20Political%20Research%20%2C%20Institutionalizing%20Big%20Data%20Methods%20in%20Social%20and%20Political%20Research.%26%23x201D%3B%20%3Ci%3EBig%20Data%20%26amp%3B%20Society%3C%5C%2Fi%3E%202%2C%20no.%202%20%282015%29%3A%202053951715591224.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951715591224%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951715591224%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DSIMVU2YD%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Institutionalizing%20Big%20Data%20methods%20in%20social%20and%20political%20research%20%2C%20Institutionalizing%20Big%20Data%20methods%20in%20social%20and%20political%20research%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Pertti%22%2C%22lastName%22%3A%22Ahonen%22%7D%5D%2C%22abstractNote%22%3A%22This%20article%20elaborates%20conclusions%20on%20how%20Big%20Data%20methods%2C%20not%20only%20by%20means%20of%20their%20%5Cu2018social%20life%5Cu2019%20but%20also%20by%20their%20%5Cu2018political%20life%5Cu2019%2C%20may%20influence%20the%20institutionalization%20of%20social%20and%20political%20research.%20To%20reach%20its%20secondary%20objective%2C%20the%20article%20re-examines%20a%20study%20of%20budgetary%20legislation%20in%2013%20countries%20carried%20out%20by%20means%20of%20Big%20Data%20methods%20to%20draw%20conclusions%20concerning%20the%20augmentation%20of%20the%20arsenal%20of%20research%20methods%2C%20the%20surrogation%20of%20existing%20research%20designs%2C%20and%20the%20re-orientation%20of%20research.%22%2C%22date%22%3A%222015%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1177%5C%2F2053951715591224%22%2C%22ISSN%22%3A%222053-9517%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951715591224%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A39%3A10Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22FGCXV2ID%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Conroy%20et%20al.%22%2C%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EConroy%2C%20Niall%20J.%2C%20Victoria%20L.%20Rubin%2C%20and%20Yimin%20Chen.%20%26%23x201C%3BAutomatic%20Deception%20Detection%3A%20Methods%20for%20Finding%20Fake%20News%3A%20Automatic%20Deception%20Detection%3A%20Methods%20for%20Finding%20Fake%20News.%26%23x201D%3B%20%3Ci%3EProceedings%20of%20the%20Association%20for%20Information%20Science%20and%20Technology%3C%5C%2Fi%3E%2052%2C%20no.%201%20%282015%29%3A%201%26%23x2013%3B4.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1002%5C%2Fpra2.2015.145052010082%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1002%5C%2Fpra2.2015.145052010082%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DFGCXV2ID%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Automatic%20deception%20detection%3A%20Methods%20for%20finding%20fake%20news%3A%20Automatic%20Deception%20Detection%3A%20Methods%20for%20Finding%20Fake%20News%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Niall%20J.%22%2C%22lastName%22%3A%22Conroy%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Victoria%20L.%22%2C%22lastName%22%3A%22Rubin%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yimin%22%2C%22lastName%22%3A%22Chen%22%7D%5D%2C%22abstractNote%22%3A%22This%20research%20surveys%20the%20current%20state%5Cu2010of%5Cu2010the%5Cu2010art%20technologies%20that%20are%20instrumental%20in%20the%20adoption%20and%20development%20of%20fake%20news%20detection.%20%5Cu201cFake%20news%20detection%5Cu201d%20is%20defined%20as%20the%20task%20of%20categorizing%20news%20along%20a%20continuum%20of%20veracity%2C%20with%20an%20associated%20measure%20of%20certainty.%20Veracity%20is%20compromised%20by%20the%20occurrence%20of%20intentional%20deceptions.%20The%20nature%20of%20online%20news%20publication%20has%20changed%2C%20such%20that%20traditional%20fact%20checking%20and%20vetting%20from%20potential%20deception%20is%20impossible%20against%20the%20flood%20arising%20from%20content%20generators%2C%20as%20well%20as%20various%20formats%20and%20genres.%5Cn%5CnThe%20paper%20provides%20a%20typology%20of%20several%20varieties%20of%20veracity%20assessment%20methods%20emerging%20from%20two%20major%20categories%20%5Cu2013%20linguistic%20cue%20approaches%20%28with%20machine%20learning%29%2C%20and%20network%20analysis%20approaches.%20We%20see%20promise%20in%20an%20innovative%20hybrid%20approach%20that%20combines%20linguistic%20cue%20and%20machine%20learning%2C%20with%20network%5Cu2010based%20behavioral%20data.%20Although%20designing%20a%20fake%20news%20detector%20is%20not%20a%20straightforward%20problem%2C%20we%20propose%20operational%20guidelines%20for%20a%20feasible%20fake%20news%20detecting%20system.%22%2C%22date%22%3A%222015%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1002%5C%2Fpra2.2015.145052010082%22%2C%22ISSN%22%3A%2223739231%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdoi.wiley.com%5C%2F10.1002%5C%2Fpra2.2015.145052010082%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-04-01T07%3A05%3A13Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Fake%20news%22%7D%2C%7B%22tag%22%3A%22Journalism%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%2C%7B%22tag%22%3A%22Natural%20language%20processing%22%7D%2C%7B%22tag%22%3A%22Network%20analysis%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22699AFQT8%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Lazer%20et%20al.%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ELazer%2C%20David%2C%20Ryan%20Kennedy%2C%20Gary%20King%2C%20and%20Alessandro%20Vespignani.%20%26%23x201C%3BThe%20Parable%20of%20Google%20Flu%3A%20Traps%20in%20Big%20Data%20Analysis.%26%23x201D%3B%20%3Ci%3EScience%3C%5C%2Fi%3E%20343%2C%20no.%206176%20%282014%29%3A%201203%26%23x2013%3B5.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1126%5C%2Fscience.1248506%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1126%5C%2Fscience.1248506%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D699AFQT8%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22The%20Parable%20of%20Google%20Flu%3A%20Traps%20in%20Big%20Data%20Analysis%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Lazer%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ryan%22%2C%22lastName%22%3A%22Kennedy%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gary%22%2C%22lastName%22%3A%22King%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alessandro%22%2C%22lastName%22%3A%22Vespignani%22%7D%5D%2C%22abstractNote%22%3A%22In%20February%202013%2C%20Google%20Flu%20Trends%20%28GFT%29%20made%20headlines%20but%20not%20for%20a%20reason%20that%20Google%20executives%20or%20the%20creators%20of%20the%20flu%20tracking%20system%20would%20have%20hoped.%20Nature%20reported%20that%20GFT%20was%20predicting%20more%20than%20double%20the%20proportion%20of%20doctor%20visits%20for%20influenza-like%20illness%20%28ILI%29%20than%20the%20Centers%20for%20Disease%20Control%20and%20Prevention%20%28CDC%29%2C%20which%20bases%20its%20estimates%20on%20surveillance%20reports%20from%20laboratories%20across%20the%20United%20States%20%281%2C%202%29.%20This%20happened%20despite%20the%20fact%20that%20GFT%20was%20built%20to%20predict%20CDC%20reports.%20Given%20that%20GFT%20is%20often%20held%20up%20as%20an%20exemplary%20use%20of%20big%20data%20%283%2C%204%29%2C%20what%20lessons%20can%20we%20draw%20from%20this%20error%3F%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1126%5C%2Fscience.1248506%22%2C%22ISSN%22%3A%220036-8075%2C%201095-9203%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fscience.sciencemag.org%5C%2Fcontent%5C%2F343%5C%2F6176%5C%2F1203%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T07%3A03%3A53Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22534QXGP7%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Kitchin%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EKitchin%2C%20Rob.%20%26%23x201C%3BBig%20Data%2C%20New%20Epistemologies%20and%20Paradigm%20Shifts.%26%23x201D%3B%20%3Ci%3EBig%20Data%20%26amp%3B%20Society%3C%5C%2Fi%3E%201%2C%20no.%201%20%282014%29%3A%202053951714528481.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951714528481%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951714528481%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D534QXGP7%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Big%20Data%2C%20new%20epistemologies%20and%20paradigm%20shifts%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Rob%22%2C%22lastName%22%3A%22Kitchin%22%7D%5D%2C%22abstractNote%22%3A%22This%20article%20examines%20how%20the%20availability%20of%20Big%20Data%2C%20coupled%20with%20new%20data%20analytics%2C%20challenges%20established%20epistemologies%20across%20the%20sciences%2C%20social%20sciences%20and%20humanities%2C%20and%20assesses%20the%20extent%20to%20which%20they%20are%20engendering%20paradigm%20shifts%20across%20multiple%20disciplines.%20In%20particular%2C%20it%20critically%20explores%20new%20forms%20of%20empiricism%20that%20declare%20%5Cu2018the%20end%20of%20theory%5Cu2019%2C%20the%20creation%20of%20data-driven%20rather%20than%20knowledge-driven%20science%2C%20and%20the%20development%20of%20digital%20humanities%20and%20computational%20social%20sciences%20that%20propose%20radically%20different%20ways%20to%20make%20sense%20of%20culture%2C%20history%2C%20economy%20and%20society.%20It%20is%20argued%20that%3A%20%281%29%20Big%20Data%20and%20new%20data%20analytics%20are%20disruptive%20innovations%20which%20are%20reconfiguring%20in%20many%20instances%20how%20research%20is%20conducted%3B%20and%20%282%29%20there%20is%20an%20urgent%20need%20for%20wider%20critical%20reflection%20within%20the%20academy%20on%20the%20epistemological%20implications%20of%20the%20unfolding%20data%20revolution%2C%20a%20task%20that%20has%20barely%20begun%20to%20be%20tackled%20despite%20the%20rapid%20changes%20in%20research%20practices%20presently%20taking%20place.%20After%20critically%20reviewing%20emerging%20epistemological%20positions%2C%20it%20is%20contended%20that%20a%20potentially%20fruitful%20approach%20would%20be%20the%20development%20of%20a%20situated%2C%20reflexive%20and%20contextually%20nuanced%20epistemology.%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1177%5C%2F2053951714528481%22%2C%22ISSN%22%3A%222053-9517%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F2053951714528481%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A56%3A06Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22DH%20Digital%20humanities%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Humanities%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22WB8S7J27%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Kitchin%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EKitchin%2C%20Rob.%20%3Ci%3EThe%20Data%20Revolution%3A%20Big%20Data%2C%20Open%20Data%2C%20Data%20Infrastructures%20%26amp%3B%20Their%20Consequences%3C%5C%2Fi%3E.%20Los%20Angeles%2C%20California%3A%20SAGE%20Publications%2C%202014.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DWB8S7J27%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22The%20data%20revolution%3A%20big%20data%2C%20open%20data%2C%20data%20infrastructures%20%26%20their%20consequences%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Rob%22%2C%22lastName%22%3A%22Kitchin%22%7D%5D%2C%22abstractNote%22%3A%22Traditionally%2C%20data%20has%20been%20a%20scarce%20commodity%20which%2C%20given%20its%20value%2C%20has%20been%20either%20jealously%20guarded%20or%20expensively%20traded.%20%20In%20recent%20years%2C%20technological%20developments%20and%20political%20lobbying%20have%20turned%20this%20position%20on%20its%20head.%20Data%20now%20flow%20as%20a%20deep%20and%20wide%20torrent%2C%20are%20low%20in%20cost%20and%20supported%20by%20robust%20infrastructures%2C%20and%20are%20increasingly%20open%20and%20accessible.%20%5Cn%5CnA%20data%20revolution%20is%20underway%2C%20one%20that%20is%20already%20reshaping%20how%20knowledge%20is%20produced%2C%20business%20conducted%2C%20and%20governance%20enacted%2C%20as%20well%20as%20raising%20many%20questions%20concerning%20surveillance%2C%20privacy%2C%20security%2C%20profiling%2C%20social%20sorting%2C%20and%20intellectual%20property%20rights.%20%5Cn%5CnIn%20contrast%20to%20the%20hype%20and%20hubris%20of%20much%20media%20and%20business%20coverage%2C%20The%20Data%20Revolution%20provides%20a%20synoptic%20and%20critical%20analysis%20of%20the%20emerging%20data%20landscape.%20%20Accessible%20in%20style%2C%20the%20book%20provides%3A%5Cn%5Cn%2A%20A%20synoptic%20overview%20of%20big%20data%2C%20open%20data%20and%20data%20infrastructures%5Cn%2A%20An%20introduction%20to%20thinking%20conceptually%20about%20data%2C%20data%20infrastructures%2C%20data%20analytics%20and%20data%20markets%5Cn%2A%20A%20critical%20discussion%20of%20the%20technical%20shortcomings%20and%20the%20social%2C%20political%20and%20ethical%20consequences%20of%20the%20data%20revolution%5Cn%2A%20An%20analysis%20of%20the%20implications%20of%20the%20data%20revolution%20to%20academic%2C%20business%20and%20government%20practices%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22978-1-4462-8747-7%20978-1-4462-8748-4%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A47%3A39Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%224DL58T8C%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Philip%20Chen%20and%20Zhang%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EPhilip%20Chen%2C%20C.%20L.%2C%20and%20Chun-Yang%20Zhang.%20%26%23x201C%3BData-Intensive%20Applications%2C%20Challenges%2C%20Techniques%20and%20Technologies%3A%20A%20Survey%20on%20Big%20Data.%26%23x201D%3B%20%3Ci%3EInformation%20Sciences%3C%5C%2Fi%3E%20275%20%282014%29%3A%20314%26%23x2013%3B47.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.ins.2014.01.015%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.ins.2014.01.015%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D4DL58T8C%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Data-intensive%20applications%2C%20challenges%2C%20techniques%20and%20technologies%3A%20A%20survey%20on%20Big%20Data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22C.%20L.%22%2C%22lastName%22%3A%22Philip%20Chen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Chun-Yang%22%2C%22lastName%22%3A%22Zhang%22%7D%5D%2C%22abstractNote%22%3A%22It%20is%20already%20true%20that%20Big%20Data%20has%20drawn%20huge%20attention%20from%20researchers%20in%20information%20sciences%2C%20policy%20and%20decision%20makers%20in%20governments%20and%20enterprises.%20As%20the%20speed%20of%20information%20growth%20exceeds%20Moore%5Cu2019s%20Law%20at%20the%20beginning%20of%20this%20new%20century%2C%20excessive%20data%20is%20making%20great%20troubles%20to%20human%20beings.%20However%2C%20there%20are%20so%20much%20potential%20and%20highly%20useful%20values%20hidden%20in%20the%20huge%20volume%20of%20data.%20A%20new%20scientific%20paradigm%20is%20born%20as%20data-intensive%20scientific%20discovery%20%28DISD%29%2C%20also%20known%20as%20Big%20Data%20problems.%20A%20large%20number%20of%20fields%20and%20sectors%2C%20ranging%20from%20economic%20and%20business%20activities%20to%20public%20administration%2C%20from%20national%20security%20to%20scientific%20researches%20in%20many%20areas%2C%20involve%20with%20Big%20Data%20problems.%20On%20the%20one%20hand%2C%20Big%20Data%20is%20extremely%20valuable%20to%20produce%20productivity%20in%20businesses%20and%20evolutionary%20breakthroughs%20in%20scientific%20disciplines%2C%20which%20give%20us%20a%20lot%20of%20opportunities%20to%20make%20great%20progresses%20in%20many%20fields.%20There%20is%20no%20doubt%20that%20the%20future%20competitions%20in%20business%20productivity%20and%20technologies%20will%20surely%20converge%20into%20the%20Big%20Data%20explorations.%20On%20the%20other%20hand%2C%20Big%20Data%20also%20arises%20with%20many%20challenges%2C%20such%20as%20difficulties%20in%20data%20capture%2C%20data%20storage%2C%20data%20analysis%20and%20data%20visualization.%20This%20paper%20is%20aimed%20to%20demonstrate%20a%20close-up%20view%20about%20Big%20Data%2C%20including%20Big%20Data%20applications%2C%20Big%20Data%20opportunities%20and%20challenges%2C%20as%20well%20as%20the%20state-of-the-art%20techniques%20and%20technologies%20we%20currently%20adopt%20to%20deal%20with%20the%20Big%20Data%20problems.%20We%20also%20discuss%20several%20underlying%20methodologies%20to%20handle%20the%20data%20deluge%2C%20for%20example%2C%20granular%20computing%2C%20cloud%20computing%2C%20bio-inspired%20computing%2C%20and%20quantum%20computing.%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1016%5C%2Fj.ins.2014.01.015%22%2C%22ISSN%22%3A%220020-0255%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fwww.sciencedirect.com%5C%2Fscience%5C%2Farticle%5C%2Fpii%5C%2FS0020025514000346%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A44%3A39Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%224WC9XI5Z%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Bail%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EBail%2C%20Christopher.%20%3Ci%3EThe%20Cultural%20Environment%3A%20Measuring%20Culture%20With%20Big%20Data%3C%5C%2Fi%3E%2C%202014.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fwww.researchgate.net%5C%2Fpublication%5C%2F260705893_The_Cultural_Environment_Measuring_Culture_With_Big_Data%27%3Ehttps%3A%5C%2F%5C%2Fwww.researchgate.net%5C%2Fpublication%5C%2F260705893_The_Cultural_Environment_Measuring_Culture_With_Big_Data%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D4WC9XI5Z%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22The%20Cultural%20Environment%3A%20Measuring%20Culture%20With%20Big%20Data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christopher%22%2C%22lastName%22%3A%22Bail%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20proposes%20asynthesis%20of%20big%20data%20and%20cultural%20sociology%20that%20adjoins%20conventional%20qualitative%20methods%20and%20newtechniques%20for%20automated%20analysis%20of%20large%20amounts%20of%20text%20in%20iterative%20fashion.%20First%2C%20it%20explains%20how%20automated%20text%20extraction%20methods%20may%20be%20used%20to%20map%20the%20contours%20of%20culturalenvironments.%20Second%2C%20it%20discusses%20the%20potential%20of%20automated%20text-classification%20methods%20toclassify%20different%20types%20of%20culture%20such%20as%20frames%2C%20schema%2C%20or%20symbolic%20boundaries.%20Finally%2C%20it%20explains%20how%20these%20new%20tools%20can%20be%20combined%20with%20conventional%20qualitative%20methods%20to%20tracethe%20evolution%20of%20such%20cultural%20elements%20over%20time.%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.researchgate.net%5C%2Fpublication%5C%2F260705893_The_Cultural_Environment_Measuring_Culture_With_Big_Data%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A39%3A18Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%228YDED8DZ%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Chen%20et%20al.%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EChen%2C%20Min%2C%20Shiwen%20Mao%2C%20and%20Yunhao%20Liu.%20%26%23x201C%3BBig%20Data%3A%20A%20Survey.%26%23x201D%3B%20%3Ci%3EMobile%20Networks%20and%20Applications%3C%5C%2Fi%3E%2019%2C%20no.%202%20%282014%29%3A%20171%26%23x2013%3B209.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2Fs11036-013-0489-0%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2Fs11036-013-0489-0%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D8YDED8DZ%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Big%20Data%3A%20A%20Survey%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Min%22%2C%22lastName%22%3A%22Chen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Shiwen%22%2C%22lastName%22%3A%22Mao%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yunhao%22%2C%22lastName%22%3A%22Liu%22%7D%5D%2C%22abstractNote%22%3A%22In%20this%20paper%2C%20we%20review%20the%20background%20and%20state-of-the-art%20of%20big%20data.%20We%20first%20introduce%20the%20general%20background%20of%20big%20data%20and%20review%20related%20technologies%2C%20such%20as%20could%20computing%2C%20Internet%20of%20Things%2C%20data%20centers%2C%20and%20Hadoop.%20We%20then%20focus%20on%20the%20four%20phases%20of%20the%20value%20chain%20of%20big%20data%2C%20i.e.%2C%20data%20generation%2C%20data%20acquisition%2C%20data%20storage%2C%20and%20data%20analysis.%20For%20each%20phase%2C%20we%20introduce%20the%20general%20background%2C%20discuss%20the%20technical%20challenges%2C%20and%20review%20the%20latest%20advances.%20We%20finally%20examine%20the%20several%20representative%20applications%20of%20big%20data%2C%20including%20enterprise%20management%2C%20Internet%20of%20Things%2C%20online%20social%20networks%2C%20medial%20applications%2C%20collective%20intelligence%2C%20and%20smart%20grid.%20These%20discussions%20aim%20to%20provide%20a%20comprehensive%20overview%20and%20big-picture%20to%20readers%20of%20this%20exciting%20area.%20This%20survey%20is%20concluded%20with%20a%20discussion%20of%20open%20problems%20and%20future%20directions.%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1007%5C%2Fs11036-013-0489-0%22%2C%22ISSN%22%3A%221572-8153%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2Fs11036-013-0489-0%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A38%3A57Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20mining%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22QLHMXFVQ%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Wu%20et%20al.%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EWu%2C%20Xindong%2C%20Xingquan%20Zhu%2C%20Gong-Qing%20Wu%2C%20and%20Wei%20Ding.%20%26%23x201C%3BData%20Mining%20with%20Big%20Data.%26%23x201D%3B%20%3Ci%3EIEEE%20Transactions%20on%20Knowledge%20and%20Data%20Engineering%3C%5C%2Fi%3E%2026%2C%20no.%201%20%282014%29%3A%2097%26%23x2013%3B107.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FTKDE.2013.109%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FTKDE.2013.109%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DQLHMXFVQ%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Data%20mining%20with%20big%20data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Xindong%22%2C%22lastName%22%3A%22Wu%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Xingquan%22%2C%22lastName%22%3A%22Zhu%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gong-Qing%22%2C%22lastName%22%3A%22Wu%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Wei%22%2C%22lastName%22%3A%22Ding%22%7D%5D%2C%22abstractNote%22%3A%22Big%20Data%20concern%20large-volume%2C%20complex%2C%20growing%20data%20sets%20with%20multiple%2C%20autonomous%20sources.%20With%20the%20fast%20development%20of%20networking%2C%20data%20storage%2C%20and%20the%20data%20collection%20capacity%2C%20Big%20Data%20are%20now%20rapidly%20expanding%20in%20all%20science%20and%20engineering%20domains%2C%20including%20physical%2C%20biological%20and%20biomedical%20sciences.%20This%20paper%20presents%20a%20HACE%20theorem%20that%20characterizes%20the%20features%20of%20the%20Big%20Data%20revolution%2C%20and%20proposes%20a%20Big%20Data%20processing%20model%2C%20from%20the%20data%20mining%20perspective.%20This%20data-driven%20model%20involves%20demand-driven%20aggregation%20of%20information%20sources%2C%20mining%20and%20analysis%2C%20user%20interest%20modeling%2C%20and%20security%20and%20privacy%20considerations.%20We%20analyze%20the%20challenging%20issues%20in%20the%20data-driven%20model%20and%20also%20in%20the%20Big%20Data%20revolution.%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1109%5C%2FTKDE.2013.109%22%2C%22ISSN%22%3A%221558-2191%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A35%3A06Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%2245LNNE27%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Burscher%20et%20al.%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EBurscher%2C%20Bj%26%23xF6%3Brn%2C%20Daan%20Odijk%2C%20Rens%20Vliegenthart%2C%20Maarten%20de%20Rijke%2C%20and%20Claes%20H.%20de%20Vreese.%20%26%23x201C%3BTeaching%20the%20Computer%20to%20Code%20Frames%20in%20News%3A%20Comparing%20Two%20Supervised%20Machine%20Learning%20Approaches%20to%20Frame%20Analysis.%26%23x201D%3B%20%3Ci%3ECommunication%20Methods%20and%20Measures%3C%5C%2Fi%3E%208%2C%20no.%203%20%282014%29%3A%20190%26%23x2013%3B206.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1080%5C%2F19312458.2014.937527%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1080%5C%2F19312458.2014.937527%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D45LNNE27%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Teaching%20the%20Computer%20to%20Code%20Frames%20in%20News%3A%20Comparing%20Two%20Supervised%20Machine%20Learning%20Approaches%20to%20Frame%20Analysis%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Bj%5Cu00f6rn%22%2C%22lastName%22%3A%22Burscher%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daan%22%2C%22lastName%22%3A%22Odijk%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Rens%22%2C%22lastName%22%3A%22Vliegenthart%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Maarten%20de%22%2C%22lastName%22%3A%22Rijke%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Claes%20H.%20de%22%2C%22lastName%22%3A%22Vreese%22%7D%5D%2C%22abstractNote%22%3A%22We%20explore%20the%20application%20of%20supervised%20machine%20learning%20%28SML%29%20to%20frame%20coding.%20By%20automating%20the%20coding%20of%20frames%20in%20news%2C%20SML%20facilitates%20the%20incorporation%20of%20large-scale%20content%20analysis%20into%20framing%20research%2C%20even%20if%20financial%20resources%20are%20scarce.%20This%20furthers%20a%20more%20integrated%20investigation%20of%20framing%20processes%20conceptually%20as%20well%20as%20methodologically.%20We%20conduct%20several%20experiments%20in%20which%20we%20automate%20the%20coding%20of%20four%20generic%20frames%20that%20are%20operationalised%20as%20a%20set%20of%20indicator%20questions.%20In%20doing%20so%2C%20we%20compare%20two%20approaches%20to%20modelling%20the%20coherence%20between%20indicator%20questions%20and%20frames%20as%20an%20SML%20task.%20The%20results%20of%20our%20experiments%20show%20that%20SML%20is%20well%20suited%20to%20automate%20frame%20coding%20but%20that%20coding%20performance%20is%20dependent%20on%20the%20way%20SML%20is%20implemented.%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1080%5C%2F19312458.2014.937527%22%2C%22ISSN%22%3A%221931-2458%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1080%5C%2F19312458.2014.937527%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-08-15T22%3A28%3A57Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Frame%20analysis%20of%20media%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%5D%7D%7D%2C%7B%22key%22%3A%226RASPP9X%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Richards%20and%20King%22%2C%22parsedDate%22%3A%222013%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ERichards%2C%20Neil%20M.%2C%20and%20Jonathan%20H.%20King.%20%26%23x201C%3BThree%20Paradoxes%20of%20Big%20Data.%26%23x201D%3B%20%3Ci%3EStanford%20Law%20Review%3C%5C%2Fi%3E%2066%20%282013%29.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fwww.stanfordlawreview.org%5C%2Fonline%5C%2Fprivacy-and-big-data-three-paradoxes-of-big-data%5C%2F%27%3Ehttps%3A%5C%2F%5C%2Fwww.stanfordlawreview.org%5C%2Fonline%5C%2Fprivacy-and-big-data-three-paradoxes-of-big-data%5C%2F%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D6RASPP9X%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Three%20Paradoxes%20of%20Big%20Data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Neil%20M.%22%2C%22lastName%22%3A%22Richards%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jonathan%20H.%22%2C%22lastName%22%3A%22King%22%7D%5D%2C%22abstractNote%22%3A%22%5BThird%20paragraph%3A%5D%20We%20don%5Cu2019t%20deny%20that%20big%20data%20holds%20substantial%20potential%20for%20the%20future%2C%20and%20that%20large%20dataset%20analysis%20has%20important%20uses%20today.%20But%20we%20would%20like%20to%20sound%20a%20cautionary%20note%20and%20pause%20to%20consider%20big%20data%5Cu2019s%20potential%20more%20critically.%20In%20particular%2C%20we%20want%20to%20highlight%20three%20paradoxes%20in%20the%20current%20rhetoric%20about%20big%20data%20to%20help%20move%20us%20toward%20a%20more%20complete%20understanding%20of%20the%20big%20data%20picture.%20First%2C%20while%20big%20data%20pervasively%20collects%20all%20manner%20of%20private%20information%2C%20the%20operations%20of%20big%20data%20itself%20are%20almost%20entirely%20shrouded%20in%20legal%20and%20commercial%20secrecy.%20We%20call%20this%20the%20Transparency%20Paradox.%20Second%2C%20though%20big%20data%20evangelists%20talk%20in%20terms%20of%20miraculous%20outcomes%2C%20this%20rhetoric%20ignores%20the%20fact%20that%20big%20data%20seeks%20to%20identify%20at%20the%20expense%20of%20individual%20and%20collective%20identity.%20We%20call%20this%20theIdentity%20Paradox.%20And%20third%2C%20the%20rhetoric%20of%20big%20data%20is%20characterized%20by%20its%20power%20to%20transform%20society%2C%20but%20big%20data%20has%20power%20effects%20of%20its%20own%2C%20which%20privilege%20large%20government%20and%20corporate%20entities%20at%20the%20expense%20of%20ordinary%20individuals.%20We%20call%20this%20the%20Power%20Paradox.%20Recognizing%20the%20paradoxes%20of%20big%20data%2C%20which%20show%20its%20perils%20alongside%20its%20potential%2C%20will%20help%20us%20to%20better%20understand%20this%20revolution.%20It%20may%20also%20allow%20us%20to%20craft%20solutions%20to%20produce%20a%20revolution%20that%20will%20be%20as%20good%20as%20its%20evangelists%20predict.%22%2C%22date%22%3A%222013%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.stanfordlawreview.org%5C%2Fonline%5C%2Fprivacy-and-big-data-three-paradoxes-of-big-data%5C%2F%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T07%3A13%3A07Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22YW57XMIT%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Ward%20and%20Barker%22%2C%22parsedDate%22%3A%222013%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EWard%2C%20Jonathan%20Stuart%2C%20and%20Adam%20Barker.%20%26%23x201C%3BUndefined%20By%20Data%3A%20A%20Survey%20of%20Big%20Data%20Definitions.%26%23x201D%3B%20%3Ci%3EArXiv%3A1309.5821%20%5BCs%5D%3C%5C%2Fi%3E%2C%202013.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1309.5821%27%3Ehttp%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1309.5821%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DYW57XMIT%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Undefined%20By%20Data%3A%20A%20Survey%20of%20Big%20Data%20Definitions%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jonathan%20Stuart%22%2C%22lastName%22%3A%22Ward%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Adam%22%2C%22lastName%22%3A%22Barker%22%7D%5D%2C%22abstractNote%22%3A%22The%20term%20big%20data%20has%20become%20ubiquitous.%20Owing%20to%20a%20shared%20origin%20between%20academia%2C%20industry%20and%20the%20media%20there%20is%20no%20single%20unified%20definition%2C%20and%20various%20stakeholders%20provide%20diverse%20and%20often%20contradictory%20definitions.%20The%20lack%20of%20a%20consistent%20definition%20introduces%20ambiguity%20and%20hampers%20discourse%20relating%20to%20big%20data.%20This%20short%20paper%20attempts%20to%20collate%20the%20various%20definitions%20which%20have%20gained%20some%20degree%20of%20traction%20and%20to%20furnish%20a%20clear%20and%20concise%20definition%20of%20an%20otherwise%20ambiguous%20term.%22%2C%22date%22%3A%222013%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1309.5821%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A54%3A10Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22LLY2NCGI%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Sagiroglu%20and%20Sinanc%22%2C%22parsedDate%22%3A%222013%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ESagiroglu%2C%20Seref%2C%20and%20Duygu%20Sinanc.%20%26%23x201C%3BBig%20Data%3A%20A%20Review.%26%23x201D%3B%20In%20%3Ci%3E2013%20International%20Conference%20on%20Collaboration%20Technologies%20and%20Systems%20%28CTS%29%3C%5C%2Fi%3E%2C%2042%26%23x2013%3B47%2C%202013.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FCTS.2013.6567202%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FCTS.2013.6567202%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DLLY2NCGI%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Big%20data%3A%20A%20review%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Seref%22%2C%22lastName%22%3A%22Sagiroglu%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Duygu%22%2C%22lastName%22%3A%22Sinanc%22%7D%5D%2C%22abstractNote%22%3A%22Big%20data%20is%20a%20term%20for%20massive%20data%20sets%20having%20large%2C%20more%20varied%20and%20complex%20structure%20with%20the%20difficulties%20of%20storing%2C%20analyzing%20and%20visualizing%20for%20further%20processes%20or%20results.%20The%20process%20of%20research%20into%20massive%20amounts%20of%20data%20to%20reveal%20hidden%20patterns%20and%20secret%20correlations%20named%20as%20big%20data%20analytics.%20These%20useful%20informations%20for%20companies%20or%20organizations%20with%20the%20help%20of%20gaining%20richer%20and%20deeper%20insights%20and%20getting%20an%20advantage%20over%20the%20competition.%20For%20this%20reason%2C%20big%20data%20implementations%20need%20to%20be%20analyzed%20and%20executed%20as%20accurately%20as%20possible.%20This%20paper%20presents%20an%20overview%20of%20big%20data%27s%20content%2C%20scope%2C%20samples%2C%20methods%2C%20advantages%20and%20challenges%20and%20discusses%20privacy%20concern%20on%20it.%22%2C%22date%22%3A%222013%22%2C%22proceedingsTitle%22%3A%222013%20International%20Conference%20on%20Collaboration%20Technologies%20and%20Systems%20%28CTS%29%22%2C%22conferenceName%22%3A%222013%20International%20Conference%20on%20Collaboration%20Technologies%20and%20Systems%20%28CTS%29%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1109%5C%2FCTS.2013.6567202%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A41%3A46Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20mining%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22NQUZ4PDT%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Labrinidis%20and%20Jagadish%22%2C%22parsedDate%22%3A%222012%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ELabrinidis%2C%20Alexandros%2C%20and%20H.%20V.%20Jagadish.%20%26%23x201C%3BChallenges%20and%20Opportunities%20with%20Big%20Data.%26%23x201D%3B%20VLDB%20Endowment%2C%202012.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.14778%5C%2F2367502.2367572%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.14778%5C%2F2367502.2367572%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DNQUZ4PDT%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22document%22%2C%22title%22%3A%22Challenges%20and%20opportunities%20with%20big%20data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alexandros%22%2C%22lastName%22%3A%22Labrinidis%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22H.%20V.%22%2C%22lastName%22%3A%22Jagadish%22%7D%5D%2C%22abstractNote%22%3A%22The%20promise%20of%20data-driven%20decision-making%20is%20now%20being%20recognized%20broadly%2C%20and%20there%20is%20growing%20enthusiasm%20for%20the%20notion%20of%20%5C%22Big%20Data%2C%5C%22%20including%20the%20recent%20announcement%20from%20the%20White%20House%20about%20new%20funding%20initiatives%20across%20different%20agencies%2C%20that%20target%20research%20for%20Big%20Data.%20While%20the%20promise%20of%20Big%20Data%20is%20real%20--%20for%20example%2C%20it%20is%20estimated%20that%20Google%20alone%20contributed%2054%20billion%20dollars%20to%20the%20US%20economy%20in%202009%20--%20there%20is%20no%20clear%20consensus%20on%20what%20is%20Big%20Data.%20In%20fact%2C%20there%20have%20been%20many%20controversial%20statements%20about%20Big%20Data%2C%20such%20as%20%5C%22Size%20is%20the%20only%20thing%20that%20matters.%5C%22%20In%20this%20panel%20we%20will%20try%20to%20explore%20the%20controversies%20and%20debunk%20the%20myths%20surrounding%20Big%20Data.%22%2C%22date%22%3A%222012%22%2C%22language%22%3A%22en%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.14778%5C%2F2367502.2367572%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T06%3A43%3A11Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22XMWPUXDX%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Drout%20and%20Smith%22%2C%22parsedDate%22%3A%222012%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EDrout%2C%20Michael%2C%20and%20Leah%20Smith.%20%3Ci%3EHow%20to%20Read%20a%20Dendogram%3C%5C%2Fi%3E%2C%202012.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fwheatoncollege.edu%5C%2Fwp-content%5C%2Fuploads%5C%2F2012%5C%2F08%5C%2FHow-to-Read-a-Dendrogram-Web-Ready.pdf%27%3Ehttps%3A%5C%2F%5C%2Fwheatoncollege.edu%5C%2Fwp-content%5C%2Fuploads%5C%2F2012%5C%2F08%5C%2FHow-to-Read-a-Dendrogram-Web-Ready.pdf%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DXMWPUXDX%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22How%20to%20read%20a%20dendogram%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michael%22%2C%22lastName%22%3A%22Drout%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Leah%22%2C%22lastName%22%3A%22Smith%22%7D%5D%2C%22abstractNote%22%3A%22%5BExplanation%20for%20beginners%20of%20the%20structure%20and%20parts%20of%20a%20hierarchical%20cluster%20dendogram%5D%22%2C%22date%22%3A%222012%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwheatoncollege.edu%5C%2Fwp-content%5C%2Fuploads%5C%2F2012%5C%2F08%5C%2FHow-to-Read-a-Dendrogram-Web-Ready.pdf%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A39Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Data%20visualization%22%7D%2C%7B%22tag%22%3A%22Hierarchical%20clustering%22%7D%2C%7B%22tag%22%3A%22Topic%20clusters%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22XL5B42VF%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Boyd%20and%20Crawford%22%2C%22parsedDate%22%3A%222011%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EBoyd%2C%20Danah%2C%20and%20Kate%20Crawford.%20%26%23x201C%3BSix%20Provocations%20for%20Big%20Data.%26%23x201D%3B%20SSRN%20Scholarly%20Paper.%20Rochester%2C%20NY%3A%20Social%20Science%20Research%20Network%2C%202011.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fpapers.ssrn.com%5C%2Fabstract%3D1926431%27%3Ehttps%3A%5C%2F%5C%2Fpapers.ssrn.com%5C%2Fabstract%3D1926431%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DXL5B42VF%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22report%22%2C%22title%22%3A%22Six%20Provocations%20for%20Big%20Data%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Danah%22%2C%22lastName%22%3A%22Boyd%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Kate%22%2C%22lastName%22%3A%22Crawford%22%7D%5D%2C%22abstractNote%22%3A%22The%20era%20of%20Big%20Data%20has%20begun.%20Computer%20scientists%2C%20physicists%2C%20economists%2C%20mathematicians%2C%20political%20scientists%2C%20bio-informaticists%2C%20sociologists%2C%20and%20many%20others%20are%20clamoring%20for%20access%20to%20the%20massive%20quantities%20of%20information%20produced%20by%20and%20about%20people%2C%20things%2C%20and%20their%20interactions.%20Diverse%20groups%20argue%20about%20the%20potential%20benefits%20and%20costs%20of%20analyzing%20information%20from%20Twitter%2C%20Google%2C%20Verizon%2C%2023andMe%2C%20Facebook%2C%20Wikipedia%2C%20and%20every%20space%20where%20large%20groups%20of%20people%20leave%20digital%20traces%20and%20deposit%20data.%20Significant%20questions%20emerge.%20Will%20large-scale%20analysis%20of%20DNA%20help%20cure%20diseases%3F%20Or%20will%20it%20usher%20in%20a%20new%20wave%20of%20medical%20inequality%3F%20Will%20data%20analytics%20help%20make%20people%5Cu2019s%20access%20to%20information%20more%20efficient%20and%20effective%3F%20Or%20will%20it%20be%20used%20to%20track%20protesters%20in%20the%20streets%20of%20major%20cities%3F%20Will%20it%20transform%20how%20we%20study%20human%20communication%20and%20culture%2C%20or%20narrow%20the%20palette%20of%20research%20options%20and%20alter%20what%20%5Cu2018research%5Cu2019%20means%3F%20Some%20or%20all%20of%20the%20above%3F%22%2C%22reportNumber%22%3A%22ID%201926431%22%2C%22reportType%22%3A%22SSRN%20Scholarly%20Paper%22%2C%22institution%22%3A%22Social%20Science%20Research%20Network%22%2C%22date%22%3A%222011%22%2C%22language%22%3A%22en%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fpapers.ssrn.com%5C%2Fabstract%3D1926431%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T07%3A09%3A11Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Big%20data%22%7D%2C%7B%22tag%22%3A%22Data%20science%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22J8CFVKRP%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Peng%22%2C%22parsedDate%22%3A%222011%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EPeng%2C%20Roger%20D.%20%26%23x201C%3BReproducible%20Research%20in%20Computational%20Science.%26%23x201D%3B%20%3Ci%3EScience%3C%5C%2Fi%3E%20334%2C%20no.%206060%20%282011%29%3A%201226%26%23x2013%3B27.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1126%5C%2Fscience.1213847%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1126%5C%2Fscience.1213847%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DJ8CFVKRP%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Reproducible%20Research%20in%20Computational%20Science%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Roger%20D.%22%2C%22lastName%22%3A%22Peng%22%7D%5D%2C%22abstractNote%22%3A%22Computational%20science%20has%20led%20to%20exciting%20new%20developments%2C%20but%20the%20nature%20of%20the%20work%20has%20exposed%20limitations%20in%20our%20ability%20to%20evaluate%20published%20findings.%20Reproducibility%20has%20the%20potential%20to%20serve%20as%20a%20minimum%20standard%20for%20judging%20scientific%20claims%20when%20full%20independent%20replication%20of%20a%20study%20is%20not%20possible.%22%2C%22date%22%3A%222011%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1126%5C%2Fscience.1213847%22%2C%22ISSN%22%3A%220036-8075%2C%201095-9203%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fscience.sciencemag.org%5C%2Fcontent%5C%2F334%5C%2F6060%5C%2F1226%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-09-03T05%3A26%3A30Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Reproducibility%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22IC8UJ7SR%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Sebastiani%22%2C%22parsedDate%22%3A%222002%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ESebastiani%2C%20Fabrizio.%20%26%23x201C%3BMachine%20Learning%20in%20Automated%20Text%20Categorization.%26%23x201D%3B%20%3Ci%3EACM%20Computing%20Surveys%20%28CSUR%29%3C%5C%2Fi%3E%2034%2C%20no.%201%20%282002%29%3A%201%26%23x2013%3B47.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F505282.505283%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F505282.505283%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DIC8UJ7SR%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Machine%20learning%20in%20automated%20text%20categorization%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Fabrizio%22%2C%22lastName%22%3A%22Sebastiani%22%7D%5D%2C%22abstractNote%22%3A%22The%20automated%20categorization%20%28or%20classification%29%20of%20texts%20into%20predefined%20categories%20has%20witnessed%20a%20booming%20interest%20in%20the%20last%2010%20years%2C%20due%20to%20the%20increased%20availability%20of%20documents%20in%20digital%20form%20and%20the%20ensuing%20need%20to%20organize%20them.%20In%20the%20research%20community%20the%20dominant%20approach%20to%20this%20problem%20is%20based%20on%20machine%20learning%20techniques%3A%20a%20general%20inductive%20process%20automatically%20builds%20a%20classifier%20by%20learning%2C%20from%20a%20set%20of%20preclassified%20documents%2C%20the%20characteristics%20of%20the%20categories.%20The%20advantages%20of%20this%20approach%20over%20the%20knowledge%20engineering%20approach%20%28consisting%20in%20the%20manual%20definition%20of%20a%20classifier%20by%20domain%20experts%29%20are%20a%20very%20good%20effectiveness%2C%20considerable%20savings%20in%20terms%20of%20expert%20labor%20power%2C%20and%20straightforward%20portability%20to%20different%20domains.%20This%20survey%20discusses%20the%20main%20approaches%20to%20text%20categorization%20that%20fall%20within%20the%20machine%20learning%20paradigm.%20We%20will%20discuss%20in%20detail%20issues%20pertaining%20to%20three%20different%20problems%2C%20namely%2C%20document%20representation%2C%20classifier%20construction%2C%20and%20classifier%20evaluation.%22%2C%22date%22%3A%222002%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1145%5C%2F505282.505283%22%2C%22ISSN%22%3A%220360-0300%2C%201557-7341%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fdoi%5C%2F10.1145%5C%2F505282.505283%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-07-25T19%3A49%3A12Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Machine%20learning%22%7D%2C%7B%22tag%22%3A%22Text%20Analysis%22%7D%2C%7B%22tag%22%3A%22Text%20classification%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22QNXMSWLC%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Miller%22%2C%22parsedDate%22%3A%221997%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EMiller%2C%20M.%20Mark.%20%26%23x201C%3BFrame%20Mapping%20and%20Analysis%20of%20News%20Coverage%20of%20Contentious%20Issues.%26%23x201D%3B%20%3Ci%3ESocial%20Science%20Computer%20Review%3C%5C%2Fi%3E%2015%2C%20no.%204%20%281997%29%3A%20367%26%23x2013%3B78.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F089443939701500403%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F089443939701500403%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DQNXMSWLC%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Frame%20Mapping%20and%20Analysis%20of%20News%20Coverage%20of%20Contentious%20Issues%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M.%20Mark%22%2C%22lastName%22%3A%22Miller%22%7D%5D%2C%22abstractNote%22%3A%22This%20article%20outlines%20a%20rigorous%20method%20for%20studying%20the%20ways%20that%20news%20media%20frame%20contentious%20issues.%20The%20method%20is%20based%20on%20the%20VBPro%20family%20of%20computer%20programs%20for%20content%20analysis.%20Output%20from%20the%20VBPro%20mapping%20program%20for%20multidimensional%20scaling%20based%20on%20co-occurence%20of%20key%20terms%20is%20cluster%20analyzed%20to%20discern%20the%20frames%20or%20points%20of%20view%20in%20texts%20that%20can%20unambiguously%20be%20attributed%20to%20competing%20stakeholders.%20These%20frames%20can%20be%20used%20to%20investigate%20propositions%20about%20news%20stories.%20An%20example%20is%20presented%20from%20a%20study%20of%201%2C465%20Associated%20Press%20articles%20on%20wetlands%20dispatched%20across%20an%2011-year%20period%20beginning%20in%201982.%20The%20results%20demonstrate%20that%20the%20method%20provides%20an%20objective%20means%20of%20investigating%20stakeholder%20influence%20on%20news%20and%20patterns%20of%20change%20in%20frames%20across%20time.%22%2C%22date%22%3A%221997%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1177%5C%2F089443939701500403%22%2C%22ISSN%22%3A%220894-4393%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1177%5C%2F089443939701500403%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-08-15T23%3A09%3A42Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Frame%20analysis%20of%20media%22%7D%5D%7D%7D%5D%7D
Smith, Gary, and Jay Cordes. The Phantom Pattern Problem: The Mirage of Big Data. First edition. Oxford ; New York, NY: Oxford University Press, 2020. Cite
Koenzen, Andreas, Neil Ernst, and Margaret-Anne Storey. “Code Duplication and Reuse in Jupyter Notebooks.” ArXiv:2005.13709 [Cs], 2020. http://arxiv.org/abs/2005.13709. Cite
Chattopadhyay, Souti, Ishita Prasad, Austin Z. Henley, Anita Sarma, and Titus Barik. “What’s Wrong with Computational Notebooks? Pain Points, Needs, and Design Opportunities.” In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems, 1–12. CHI ’20. Honolulu, HI, USA: Association for Computing Machinery, 2020. https://doi.org/10.1145/3313831.3376729. Cite
DePratti, Roland. “Jupyter Notebooks versus a Textbook in a Big Data Course.” Journal of Computing Sciences in Colleges 35, no. 8 (2020): 208–20. https://dl.acm.org/doi/abs/10.5555/3417639.3417658. Cite
Willis, Alistair, Patricia Charlton, and Tony Hirst. “Developing Students’ Written Communication Skills with Jupyter Notebooks.” In Proceedings of the 51st ACM Technical Symposium on Computer Science Education, 1089–95. SIGCSE ’20. Portland, OR, USA: Association for Computing Machinery, 2020. https://doi.org/10.1145/3328778.3366927. Cite
Thylstrup, Nanna Bonde, ed. Uncertain Archives: Critical Keywords for Big Data. Cambridge, Massachusetts: The MIT Press, 2020. Cite
Kwak, Haewoon, Jisun An, and Yong-Yeol Ahn. “A Systematic Media Frame Analysis of 1.5 Million New York Times Articles from 2000 to 2017.” ArXiv:2005.01803 [Cs], 2020. http://arxiv.org/abs/2005.01803. Cite
Munro, Robert. Human-in-the-Loop Machine Learning. Shelter Island, New York: Manning, 2020. https://www.manning.com/books/human-in-the-loop-machine-learning. Cite
Wang, April Yi, Anant Mittal, Christopher Brooks, and Steve Oney. “How Data Scientists Use Computational Notebooks for Real-Time Collaboration.” Association for Computing Machinery, 2019. https://doi.org/10.1145/3359141. Cite
Rule, Adam, Amanda Birmingham, Cristal Zuniga, Ilkay Altintas, Shih-Cheng Huang, Rob Knight, Niema Moshiri, et al. “Ten Simple Rules for Writing and Sharing Computational Analyses in Jupyter Notebooks.” PLOS Computational Biology 15, no. 7 (2019): e1007007. https://doi.org/10.1371/journal.pcbi.1007007. Cite
Pandey, Parul. Interpretable Machine Learning, 2019. https://towardsdatascience.com/interpretable-machine-learning-1dec0f2f3e6b. Cite
Wikipedia. Confusion Matrix, 2019. https://en.wikipedia.org/w/index.php?title=Confusion_matrix&oldid=881721342. Cite
“Big Data Technologies: A Survey.” Journal of King Saud University - Computer and Information Sciences 30, no. 4 (2018): 431–48. https://doi.org/10.1016/j.jksuci.2017.06.001. Cite
Kery, Mary Beth, Marissa Radensky, Mahima Arya, Bonnie E. John, and Brad A. Myers. “The Story in the Notebook: Exploratory Data Science Using a Literate Programming Tool.” In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1–11. CHI ’18. Montreal QC, Canada: Association for Computing Machinery, 2018. https://doi.org/10.1145/3173574.3173748. Cite
Narkhede, Sarang. Understanding Confusion Matrix, 2018. https://towardsdatascience.com/understanding-confusion-matrix-a9ad42dcfd62. Cite
Kleinman, Scott, Mark D. LeBlanc, and Michael Drout. Hierarchical Clustering, 2018. http://scalar.usc.edu/works/lexos/hierarchical-clustering?path=manual. Cite
Randles, Bernadette M., Irene V. Pasquetto, Milena S. Golshan, and Christine L. Borgman. “Using the Jupyter Notebook as a Tool for Open Science: An Empirical Study.” In 2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL), 1–2, 2017. https://doi.org/10.1109/JCDL.2017.7991618. Cite
Jupyter, Project. “Project Jupyter: Computational Narratives as the Engine of Collaborative Data Science.” Medium, 2017. https://blog.jupyter.org/project-jupyter-computational-narratives-as-the-engine-of-collaborative-data-science-2b5fb94c3c58. Cite
Ruchansky, Natali, Sungyong Seo, and Yan Liu. “CSI: A Hybrid Deep Model for Fake News Detection.” In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 797–806. CIKM ’17. Singapore, Singapore: Association for Computing Machinery, 2017. https://doi.org/10.1145/3132847.3132877. Cite
Wang, William Yang. “‘Liar, Liar Pants on Fire’: A New Benchmark Dataset for Fake News Detection.” ArXiv:1705.00648 [Cs], 2017. http://arxiv.org/abs/1705.00648. Cite
Mützel, Sophie. “Facing Big Data: Making Sociology Relevant , Facing Big Data: Making Sociology Relevant.” Big Data & Society 2, no. 2 (2015): 2053951715599179. https://doi.org/10.1177/2053951715599179. Cite
Hashem, Ibrahim Abaker Targio, Ibrar Yaqoob, Nor Badrul Anuar, Salimah Mokhtar, Abdullah Gani, and Samee Ullah Khan. “The Rise of ‘Big Data’ on Cloud Computing: Review and Open Research Issues.” Information Systems 47 (2015): 98–115. https://doi.org/10.1016/j.is.2014.07.006. Cite
“Beyond the Hype: Big Data Concepts, Methods, and Analytics.” International Journal of Information Management 35, no. 2 (2015): 137–44. https://doi.org/10.1016/j.ijinfomgt.2014.10.007. Cite
Ahonen, Pertti. “Institutionalizing Big Data Methods in Social and Political Research , Institutionalizing Big Data Methods in Social and Political Research.” Big Data & Society 2, no. 2 (2015): 2053951715591224. https://doi.org/10.1177/2053951715591224. Cite
Conroy, Niall J., Victoria L. Rubin, and Yimin Chen. “Automatic Deception Detection: Methods for Finding Fake News: Automatic Deception Detection: Methods for Finding Fake News.” Proceedings of the Association for Information Science and Technology 52, no. 1 (2015): 1–4. https://doi.org/10.1002/pra2.2015.145052010082. Cite
Lazer, David, Ryan Kennedy, Gary King, and Alessandro Vespignani. “The Parable of Google Flu: Traps in Big Data Analysis.” Science 343, no. 6176 (2014): 1203–5. https://doi.org/10.1126/science.1248506. Cite
Kitchin, Rob. “Big Data, New Epistemologies and Paradigm Shifts.” Big Data & Society 1, no. 1 (2014): 2053951714528481. https://doi.org/10.1177/2053951714528481. Cite
Kitchin, Rob. The Data Revolution: Big Data, Open Data, Data Infrastructures & Their Consequences. Los Angeles, California: SAGE Publications, 2014. Cite
Philip Chen, C. L., and Chun-Yang Zhang. “Data-Intensive Applications, Challenges, Techniques and Technologies: A Survey on Big Data.” Information Sciences 275 (2014): 314–47. https://doi.org/10.1016/j.ins.2014.01.015. Cite
Bail, Christopher. The Cultural Environment: Measuring Culture With Big Data, 2014. https://www.researchgate.net/publication/260705893_The_Cultural_Environment_Measuring_Culture_With_Big_Data. Cite
Chen, Min, Shiwen Mao, and Yunhao Liu. “Big Data: A Survey.” Mobile Networks and Applications 19, no. 2 (2014): 171–209. https://doi.org/10.1007/s11036-013-0489-0. Cite
Wu, Xindong, Xingquan Zhu, Gong-Qing Wu, and Wei Ding. “Data Mining with Big Data.” IEEE Transactions on Knowledge and Data Engineering 26, no. 1 (2014): 97–107. https://doi.org/10.1109/TKDE.2013.109. Cite
Burscher, Björn, Daan Odijk, Rens Vliegenthart, Maarten de Rijke, and Claes H. de Vreese. “Teaching the Computer to Code Frames in News: Comparing Two Supervised Machine Learning Approaches to Frame Analysis.” Communication Methods and Measures 8, no. 3 (2014): 190–206. https://doi.org/10.1080/19312458.2014.937527. Cite
Richards, Neil M., and Jonathan H. King. “Three Paradoxes of Big Data.” Stanford Law Review 66 (2013). https://www.stanfordlawreview.org/online/privacy-and-big-data-three-paradoxes-of-big-data/. Cite
Ward, Jonathan Stuart, and Adam Barker. “Undefined By Data: A Survey of Big Data Definitions.” ArXiv:1309.5821 [Cs], 2013. http://arxiv.org/abs/1309.5821. Cite
Sagiroglu, Seref, and Duygu Sinanc. “Big Data: A Review.” In 2013 International Conference on Collaboration Technologies and Systems (CTS), 42–47, 2013. https://doi.org/10.1109/CTS.2013.6567202. Cite
Labrinidis, Alexandros, and H. V. Jagadish. “Challenges and Opportunities with Big Data.” VLDB Endowment, 2012. https://doi.org/10.14778/2367502.2367572. Cite
Drout, Michael, and Leah Smith. How to Read a Dendogram, 2012. https://wheatoncollege.edu/wp-content/uploads/2012/08/How-to-Read-a-Dendrogram-Web-Ready.pdf. Cite
Boyd, Danah, and Kate Crawford. “Six Provocations for Big Data.” SSRN Scholarly Paper. Rochester, NY: Social Science Research Network, 2011. https://papers.ssrn.com/abstract=1926431. Cite
Peng, Roger D. “Reproducible Research in Computational Science.” Science 334, no. 6060 (2011): 1226–27. https://doi.org/10.1126/science.1213847. Cite
Sebastiani, Fabrizio. “Machine Learning in Automated Text Categorization.” ACM Computing Surveys (CSUR) 34, no. 1 (2002): 1–47. https://doi.org/10.1145/505282.505283. Cite
Miller, M. Mark. “Frame Mapping and Analysis of News Coverage of Contentious Issues.” Social Science Computer Review 15, no. 4 (1997): 367–78. https://doi.org/10.1177/089443939701500403. Cite