Bibliography – Topic Model Optimization

Sorted by: Author | Title | Date (with abstracts) | Recently Added

(all) Corpus Representativeness Comparison paradigms for idea of a corpus: Archives as Paradigm | Canons as Paradigm | Editions as Paradigm | Corpus Linguistics as Paradigm

Topic Modeling (all)

Selected DH research and resources bearing on, or utilized by, the WE1S project.

(all) Grounded Theory | Human Subjects Research

(all) | Publications | Talks | Research Blog Posts (selected)

Searchable version of bibliography on Zotero site For WE1S developers: Biblio style guide | Biblio collection form (suggest additions) | WE1S Bibliography Ontology Outline

2133649 Topic Model Optimization 1 chicago-fullnote-bibliography 50 date desc year 1 1 1 2401 https://we1s.ucsb.edu/wp-content/plugins/zotpress/

%7B%22status%22%3A%22success%22%2C%22updateneeded%22%3Afalse%2C%22instance%22%3Afalse%2C%22meta%22%3A%7B%22request_last%22%3A0%2C%22request_next%22%3A0%2C%22used_cache%22%3Atrue%7D%2C%22data%22%3A%5B%7B%22key%22%3A%22YBHGFYCA%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Kapadia%22%2C%22parsedDate%22%3A%222020%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BKapadia%2C%20Shashank.%20%26%23x201C%3BEvaluate%20Topic%20Models%3A%20Latent%20Dirichlet%20Allocation%20%28LDA%29.%26%23x201D%3B%20Medium%2C%202020.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Fevaluate-topic-model-in-python-latent-dirichlet-allocation-lda-7d57484bb5d0%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Fevaluate-topic-model-in-python-latent-dirichlet-allocation-lda-7d57484bb5d0%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DYBHGFYCA%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22webpage%22%2C%22title%22%3A%22Evaluate%20Topic%20Models%3A%20Latent%20Dirichlet%20Allocation%20%28LDA%29%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Shashank%22%2C%22lastName%22%3A%22Kapadia%22%7D%5D%2C%22abstractNote%22%3A%22A%20step-by-step%20guide%20to%20building%20interpretable%20topic%20models.%22%2C%22date%22%3A%222020%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Fevaluate-topic-model-in-python-latent-dirichlet-allocation-lda-7d57484bb5d0%22%2C%22language%22%3A%22en%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222021-02-15T08%3A36%3A04Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20interpretation%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22SWIMDFE4%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Wikipedia%22%2C%22parsedDate%22%3A%222019%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BWikipedia.%20%26lt%3Bi%26gt%3BConfusion%20Matrix%26lt%3B%5C%2Fi%26gt%3B%2C%202019.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fen.wikipedia.org%5C%2Fw%5C%2Findex.php%3Ftitle%3DConfusion_matrix%26amp%3Boldid%3D881721342%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fen.wikipedia.org%5C%2Fw%5C%2Findex.php%3Ftitle%3DConfusion_matrix%26amp%3Boldid%3D881721342%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DSWIMDFE4%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Confusion%20matrix%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22name%22%3A%22Wikipedia%22%7D%5D%2C%22abstractNote%22%3A%22In%20the%20field%20of%20machine%20learning%20and%20specifically%20the%20problem%20of%20statistical%20classification%2C%20a%20confusion%20matrix%2C%20also%20known%20as%20an%20error%20matrix%2C%20is%20a%20specific%20table%20layout%20that%20allows%20visualization%20of%20the%20performance%20of%20an%20algorithm%2C%20typically%20a%20supervised%20learning%20one%20%28in%20unsupervised%20learning%20it%20is%20usually%20called%20a%20matching%20matrix%29.%20Each%20row%20of%20the%20matrix%20represents%20the%20instances%20in%20a%20predicted%20class%20while%20each%20column%20represents%20the%20instances%20in%20an%20actual%20class%20%28or%20vice%20versa%29.%20The%20name%20stems%20from%20the%20fact%20that%20it%20makes%20it%20easy%20to%20see%20if%20the%20system%20is%20confusing%20two%20classes%20%28i.e.%20commonly%20mislabeling%20one%20as%20another%29.%22%2C%22date%22%3A%222019%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fen.wikipedia.org%5C%2Fw%5C%2Findex.php%3Ftitle%3DConfusion_matrix%26oldid%3D881721342%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A48Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Statistics%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22R8CIPUFG%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Syed%20and%20Spruit%22%2C%22parsedDate%22%3A%222018%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BSyed%2C%20S.%2C%20and%20M.%20Spruit.%20%26%23x201C%3BSelecting%20Priors%20for%20Latent%20Dirichlet%20Allocation.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3B2018%20IEEE%2012th%20International%20Conference%20on%20Semantic%20Computing%20%28ICSC%29%26lt%3B%5C%2Fi%26gt%3B%2C%20194%26%23x2013%3B202%2C%202018.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FICSC.2018.00035%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FICSC.2018.00035%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DR8CIPUFG%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Selecting%20Priors%20for%20Latent%20Dirichlet%20Allocation%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22S.%22%2C%22lastName%22%3A%22Syed%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M.%22%2C%22lastName%22%3A%22Spruit%22%7D%5D%2C%22abstractNote%22%3A%22Latent%20Dirichlet%20Allocation%20%28LDA%29%20has%20gained%20much%20attention%20from%20researchers%20and%20is%20increasingly%20being%20applied%20to%20uncover%20underlying%20semantic%20structures%20from%20a%20variety%20of%20corpora.%20However%2C%20nearly%20all%20researchers%20use%20symmetrical%20Dirichlet%20priors%2C%20often%20unaware%20of%20the%20underlying%20practical%20implications%20that%20they%20bear.%20This%20research%20is%20the%20first%20to%20explore%20symmetrical%20and%20asymmetrical%20Dirichlet%20priors%20on%20topic%20coherence%20and%20human%20topic%20ranking%20when%20uncovering%20latent%20semantic%20structures%20from%20scientific%20research%20articles.%20More%20specifically%2C%20we%20examine%20the%20practical%20effects%20of%20several%20classes%20of%20Dirichlet%20priors%20on%202000%20LDA%20models%20created%20from%20abstract%20and%20full-text%20research%20articles.%20Our%20results%20show%20that%20symmetrical%20or%20asymmetrical%20priors%20on%20the%20document-topic%20distribution%20or%20the%20topic-word%20distribution%20for%20full-text%20data%20have%20little%20effect%20on%20topic%20coherence%20scores%20and%20human%20topic%20ranking.%20In%20contrast%2C%20asymmetrical%20priors%20on%20the%20document-topic%20distribution%20for%20abstract%20data%20show%20a%20significant%20increase%20in%20topic%20coherence%20scores%20and%20improved%20human%20topic%20ranking%20compared%20to%20a%20symmetrical%20prior.%20Symmetrical%20or%20asymmetrical%20priors%20on%20the%20topic-word%20distribution%20show%20no%20real%20benefits%20for%20both%20abstract%20and%20full-text%20data.%22%2C%22date%22%3A%222018%22%2C%22proceedingsTitle%22%3A%222018%20IEEE%2012th%20International%20Conference%20on%20Semantic%20Computing%20%28ICSC%29%22%2C%22conferenceName%22%3A%222018%20IEEE%2012th%20International%20Conference%20on%20Semantic%20Computing%20%28ICSC%29%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1109%5C%2FICSC.2018.00035%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222021-02-15T08%3A41%3A33Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20interpretation%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%222US5AIKC%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22George%20and%20Doss%22%2C%22parsedDate%22%3A%222018%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BGeorge%2C%20Clint%20P.%2C%20and%20Hani%20Doss.%20%26%23x201C%3BPrincipled%20Selection%20of%20Hyperparameters%20in%20the%20Latent%20Dirichlet%20Allocation%20Model.%26%23x201D%3B%20%26lt%3Bi%26gt%3BJournal%20of%20Machine%20Learning%20Research%26lt%3B%5C%2Fi%26gt%3B%2018%2C%20no.%20162%20%282018%29%3A%201%26%23x2013%3B38.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttp%3A%5C%2F%5C%2Fjmlr.org%5C%2Fpapers%5C%2Fv18%5C%2F15-595.html%26%23039%3B%26gt%3Bhttp%3A%5C%2F%5C%2Fjmlr.org%5C%2Fpapers%5C%2Fv18%5C%2F15-595.html%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3D2US5AIKC%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Principled%20Selection%20of%20Hyperparameters%20in%20the%20Latent%20Dirichlet%20Allocation%20Model%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Clint%20P.%22%2C%22lastName%22%3A%22George%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Hani%22%2C%22lastName%22%3A%22Doss%22%7D%5D%2C%22abstractNote%22%3A%22Latent%20Dirichlet%20Allocation%20%28LDA%29%20is%20a%20well%20known%20topic%20model%20that%20is%20often%20used%20to%20make%20inference%20regarding%20the%20properties%20of%20collections%20of%20text%20documents.%20LDA%20is%20a%20hierarchical%20Bayesian%20model%2C%20and%20involves%20a%20prior%20distribution%20on%20a%20set%20of%20latent%20topic%20variables.%20The%20prior%20is%20indexed%20by%20certain%20hyperparameters%2C%20and%20even%20though%20these%20have%20a%20large%20impact%20on%20inference%2C%20they%20are%20usually%20chosen%20either%20in%20an%20ad-hoc%20manner%2C%20or%20by%20applying%20an%20algorithm%20whose%20theoretical%20basis%20has%20not%20been%20firmly%20established.%20We%20present%20a%20method%2C%20based%20on%20a%20combination%20of%20Markov%20chain%20Monte%20Carlo%20and%20importance%20sampling%2C%20for%20estimating%20the%20maximum%20likelihood%20estimate%20of%20the%20hyperparameters.%20The%20method%20may%20be%20viewed%20as%20a%20computational%20scheme%20for%20implementation%20of%20an%20empirical%20Bayes%20analysis.%20It%20comes%20with%20theoretical%20guarantees%2C%20and%20a%20key%20feature%20of%20our%20approach%20is%20that%20we%20provide%20theoretically-valid%20error%20margins%20for%20our%20estimates.%20Experiments%20on%20both%20synthetic%20and%20real%20data%20show%20good%20performance%20of%20our%20methodology.%22%2C%22date%22%3A%222018%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%221533-7928%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fjmlr.org%5C%2Fpapers%5C%2Fv18%5C%2F15-595.html%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222021-02-15T08%3A31%3A36Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%228KT424LE%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Narkhede%22%2C%22parsedDate%22%3A%222018%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BNarkhede%2C%20Sarang.%20%26lt%3Bi%26gt%3BUnderstanding%20Confusion%20Matrix%26lt%3B%5C%2Fi%26gt%3B%2C%202018.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Funderstanding-confusion-matrix-a9ad42dcfd62%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Funderstanding-confusion-matrix-a9ad42dcfd62%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3D8KT424LE%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Understanding%20Confusion%20Matrix%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sarang%22%2C%22lastName%22%3A%22Narkhede%22%7D%5D%2C%22abstractNote%22%3A%22When%20data%20is%20gathered%2C%20after%20data%20cleaning%2C%20pre-processing%20and%20wrangling%2C%20the%20first%20step%20is%20to%20feed%20it%20to%20an%20outstanding%20model%20and%20of%20course%2C%20get%20output%20in%20probabilities.%20But%20hold%20on%21%20How%20in%20the%20hell%20can%20one%20measure%20the%20effectiveness%20of%20their%20model%3F%20The%20better%20the%20effectiveness%2C%20the%20better%20the%20performance%2C%20and%20that%5Cu2019s%20exactly%20what%20we%20want.%20And%20it%20is%20where%20the%20Confusion%20matrix%20comes%20into%20the%20limelight.%20Confusion%20Matrix%20is%20a%20performance%20measurement%20for%20machine%20learning%20classification.%20This%20blog%20aims%20to%20answer%20following%20questions%3A%20What%20the%20confusion%20matrix%20is%20and%20why%20you%20need%20it%3F%20How%20to%20calculate%20Confusion%20Matrix%20for%20a%202-class%20classification%20problem%3F%22%2C%22date%22%3A%222018%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Ftowardsdatascience.com%5C%2Funderstanding-confusion-matrix-a9ad42dcfd62%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A40%3A47Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Data%20science%22%7D%2C%7B%22tag%22%3A%22Statistics%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22W5MQRCY3%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Dewi%20and%20Thiel%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BDewi%2C%20Andisa%2C%20and%20Kilian%20Thiel.%20%26%23x201C%3BTopic%20Extraction%3A%20Optimizing%20the%20Number%20of%20Topics%20with%20the%20Elbow%20Method.%26%23x201D%3B%20%26lt%3Bi%26gt%3BKNIME%26lt%3B%5C%2Fi%26gt%3B%20%28blog%29%2C%202017.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fwww.knime.com%5C%2Fblog%5C%2Ftopic-extraction-optimizing-the-number-of-topics-with-the-elbow-method%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fwww.knime.com%5C%2Fblog%5C%2Ftopic-extraction-optimizing-the-number-of-topics-with-the-elbow-method%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DW5MQRCY3%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22blogPost%22%2C%22title%22%3A%22Topic%20Extraction%3A%20Optimizing%20the%20Number%20of%20Topics%20with%20the%20Elbow%20Method%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Andisa%22%2C%22lastName%22%3A%22Dewi%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Kilian%22%2C%22lastName%22%3A%22Thiel%22%7D%5D%2C%22abstractNote%22%3A%22%5BSecond%20paragraph%3A%5D%20In%20this%20blog%20post%20we%20will%20show%20a%20step-by-step%20example%20of%20how%20to%20determine%20the%20optimal%20number%20of%20topics%20using%20clustering%20and%20how%20to%20extract%20the%20topics%20from%20a%20collection%20of%20text%20documents%2C%20using%20the%20KNIME%20Text%20Processing%20extension.%22%2C%22blogTitle%22%3A%22KNIME%22%2C%22date%22%3A%222017%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.knime.com%5C%2Fblog%5C%2Ftopic-extraction-optimizing-the-number-of-topics-with-the-elbow-method%22%2C%22language%22%3A%22en%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222021-01-25T06%3A49%3A37Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%2264R5TQDV%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Ellis%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BEllis%2C%20Peter.%20%26lt%3Bi%26gt%3BCross-Validation%20of%20Topic%20Modelling%26lt%3B%5C%2Fi%26gt%3B%2C%202017.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttp%3A%5C%2F%5C%2Ffreerangestats.info%5C%2Fblog%5C%2F2017%5C%2F01%5C%2F05%5C%2Ftopic-model-cv.html%26%23039%3B%26gt%3Bhttp%3A%5C%2F%5C%2Ffreerangestats.info%5C%2Fblog%5C%2F2017%5C%2F01%5C%2F05%5C%2Ftopic-model-cv.html%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3D64R5TQDV%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Cross-validation%20of%20topic%20modelling%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Peter%22%2C%22lastName%22%3A%22Ellis%22%7D%5D%2C%22abstractNote%22%3A%22Cross-validation%20of%20the%20%26quot%3Bperplexity%26quot%3B%20from%20a%20topic%20model%2C%20to%20help%20determine%20a%20good%20number%20of%20topics.%22%2C%22date%22%3A%222017%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Ffreerangestats.info%5C%2Fblog%5C%2F2017%5C%2F01%5C%2F05%5C%2Ftopic-model-cv.html%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A39Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22H6JDPB9Q%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Wisdom%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BWisdom%2C%20Alyssa.%20%26lt%3Bi%26gt%3BTopic%20Modeling%26lt%3B%5C%2Fi%26gt%3B%2C%202017.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fmedium.com%5C%2Fsquare-corner-blog%5C%2Ftopic-modeling-optimizing-for-human-interpretability-48a81f6ce0ed%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fmedium.com%5C%2Fsquare-corner-blog%5C%2Ftopic-modeling-optimizing-for-human-interpretability-48a81f6ce0ed%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DH6JDPB9Q%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Topic%20Modeling%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alyssa%22%2C%22lastName%22%3A%22Wisdom%22%7D%5D%2C%22abstractNote%22%3A%22Optimizing%20for%20Human%20Interpretability%22%2C%22date%22%3A%222017%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fmedium.com%5C%2Fsquare-corner-blog%5C%2Ftopic-modeling-optimizing-for-human-interpretability-48a81f6ce0ed%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A40%3A57Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20interpretation%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%2229I95RIW%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Schofield%20et%20al.%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BSchofield%2C%20Alexandra%2C%20Laure%20Thompson%2C%20and%20David%20Mimno.%20%26%23x201C%3BQuantifying%20the%20Effects%20of%20Text%20Duplication%20on%20Semantic%20Models.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%202017%20Conference%20on%20Empirical%20Methods%20in%20Natural%20Language%20Processing%26lt%3B%5C%2Fi%26gt%3B%2C%202737%26%23x2013%3B47.%20Copenhagen%2C%20Denmark%3A%20Association%20for%20Computational%20Linguistics%2C%202017.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.18653%5C%2Fv1%5C%2FD17-1290%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.18653%5C%2Fv1%5C%2FD17-1290%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3D29I95RIW%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Quantifying%20the%20Effects%20of%20Text%20Duplication%20on%20Semantic%20Models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alexandra%22%2C%22lastName%22%3A%22Schofield%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Laure%22%2C%22lastName%22%3A%22Thompson%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Mimno%22%7D%5D%2C%22abstractNote%22%3A%22Duplicate%20documents%20are%20a%20pervasive%20problem%20in%20text%20datasets%20and%20can%20have%20a%20strong%20effect%20on%20unsupervised%20models.%20Methods%20to%20remove%20duplicate%20texts%20are%20typically%20heuristic%20or%20very%20expensive%2C%20so%20it%20is%20vital%20to%20know%20when%20and%20why%20they%20are%20needed.%20We%20measure%20the%20sensitivity%20of%20two%20latent%20semantic%20methods%20to%20the%20presence%20of%20different%20levels%20of%20document%20repetition.%20By%20artificially%20creating%20different%20forms%20of%20duplicate%20text%20we%20confirm%20several%20hypotheses%20about%20how%20repeated%20text%20impacts%20models.%20While%20a%20small%20amount%20of%20duplication%20is%20tolerable%2C%20substantial%20over-representation%20of%20subsets%20of%20the%20text%20may%20overwhelm%20meaningful%20topical%20patterns.%22%2C%22date%22%3A%222017%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%202017%20Conference%20on%20Empirical%20Methods%20in%20Natural%20Language%20Processing%22%2C%22conferenceName%22%3A%22Proceedings%20of%20the%202017%20Conference%20on%20Empirical%20Methods%20in%20Natural%20%20%20%20%20%20%20%20%20%20%20Language%20Processing%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.18653%5C%2Fv1%5C%2FD17-1290%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Faclweb.org%5C%2Fanthology%5C%2FD17-1290%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-11T22%3A49%3A31Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22IBL6BQ73%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Sch%5Cu00f6ch%22%2C%22parsedDate%22%3A%222016%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BSch%26%23xF6%3Bch%2C%20Christof.%20%26lt%3Bi%26gt%3BTopic%20Modeling%20with%20MALLET%3A%20Hyperparameter%20Optimization%26lt%3B%5C%2Fi%26gt%3B%2C%202016.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdragonfly.hypotheses.org%5C%2F1051%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdragonfly.hypotheses.org%5C%2F1051%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DIBL6BQ73%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Topic%20Modeling%20with%20MALLET%3A%20Hyperparameter%20Optimization%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christof%22%2C%22lastName%22%3A%22Sch%5Cu00f6ch%22%7D%5D%2C%22abstractNote%22%3A%22This%20is%20a%20short%20technical%20post%20about%20an%20interesting%20feature%20of%20Mallet%20which%20I%20have%20recently%20discovered%20or%20rather%2C%20whose%20%28for%20me%29%20unexpected%20effect%20on%20the%20topic%20models%20I%20have%20discovered%3A%20the%20parameter%20that%20controls%20the%20hyperparameter%20optimization%20interval%20in%20Mallet.%22%2C%22date%22%3A%222016%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdragonfly.hypotheses.org%5C%2F1051%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A40%3A52Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22LXYQTF2R%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Soltoff%22%2C%22parsedDate%22%3A%222016%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BSoltoff%2C%20Benjamin.%20%26lt%3Bi%26gt%3BText%20Analysis%3A%20Topic%20Modeling%26lt%3B%5C%2Fi%26gt%3B%2C%202016.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fcfss.uchicago.edu%5C%2Ffall2016%5C%2Ftext02.html%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fcfss.uchicago.edu%5C%2Ffall2016%5C%2Ftext02.html%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DLXYQTF2R%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22book%22%2C%22title%22%3A%22Text%20analysis%3A%20topic%20modeling%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Benjamin%22%2C%22lastName%22%3A%22Soltoff%22%7D%5D%2C%22abstractNote%22%3A%22Explanation%20of%20LDA%20topic%20modeling%20for%20a%202016%20course%20titled%20%26quot%3BComputing%20for%20the%20Social%20Sciences.%26quot%3B%20Includes%20intutive%20introductory%20examples%20as%20well%20as%20introductions%20to%20the%20mathematical%20logic%20involved.%20Concludes%20with%20discussion%20of%20optimizing%20the%20number%20of%20topics%20in%20a%20topic%20model%20by%20using%20%26quot%3Bperplexity%26quot%3B%20measures.%22%2C%22date%22%3A%222016%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fcfss.uchicago.edu%5C%2Ffall2016%5C%2Ftext02.html%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A40%3A54Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20introductions%20and%20tutorials%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22THIFIFS9%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Allahyari%20and%20Kochut%22%2C%22parsedDate%22%3A%222016%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BAllahyari%2C%20Mehdi%2C%20and%20Krys%20Kochut.%20%26%23x201C%3BDiscovering%20Coherent%20Topics%20with%20Entity%20Topic%20Models.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3B2016%20IEEE%5C%2FWIC%5C%2FACM%20International%20Conference%20on%20Web%20Intelligence%20%28WI%29%26lt%3B%5C%2Fi%26gt%3B%2C%2026%26%23x2013%3B33.%20Omaha%2C%20NE%2C%20USA%3A%20IEEE%2C%202016.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FWI.2016.0015%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FWI.2016.0015%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DTHIFIFS9%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Discovering%20Coherent%20Topics%20with%20Entity%20Topic%20Models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mehdi%22%2C%22lastName%22%3A%22Allahyari%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Krys%22%2C%22lastName%22%3A%22Kochut%22%7D%5D%2C%22abstractNote%22%3A%22Probabilistic%20topic%20models%20are%20powerful%20techniques%20which%20are%20widely%20used%20for%20discovering%20topics%20or%20semantic%20content%20from%20a%20large%20collection%20of%20documents.%20However%2C%20because%20topic%20models%20are%20entirely%20unsupervised%2C%20they%20may%20lead%20to%20topics%20that%20are%20not%20understandable%20in%20applications.%20Recently%2C%20several%20knowledge-based%20topic%20models%20have%20been%20proposed%20which%20primarily%20use%20word-level%20domain%20knowledge%20in%20the%20model%20to%20enhance%20the%20topic%20coherence%20and%20ignore%20the%20rich%20information%20carried%20by%20entities%20%28e.g%20persons%2C%20location%2C%20organizations%2C%20etc.%29%20associated%20with%20the%20documents.%20Additionally%2C%20there%20exists%20a%20vast%20amount%20of%20prior%20knowledge%20%28background%20knowledge%29%20represented%20as%20ontologies%20and%20Linked%20Open%20Data%20%28LOD%29%2C%20which%20can%20be%20incorporated%20into%20the%20topic%20models%20to%20produce%20coherent%20topics.%20In%20this%20paper%2C%20we%20introduce%20a%20novel%20entity-based%20topic%20model%2C%20called%20EntLDA%2C%20to%20effectively%20integrate%20an%20ontology%20with%20an%20entity%20topic%20model%20to%20improve%20the%20topic%20modeling%20process.%20Furthermore%2C%20to%20increase%20the%20coherence%20of%20the%20identified%20topics%2C%20we%20introduce%20a%20novel%20ontology-based%20regularization%20framework%2C%20which%20is%20then%20integrated%20with%20the%20EntLDA%20model.%20Our%20experimental%20results%20demonstrate%20the%20effectiveness%20of%20the%20proposed%20model%20in%20improving%20the%20coherence%20of%20the%20topics.%22%2C%22date%22%3A%222016%22%2C%22proceedingsTitle%22%3A%222016%20IEEE%5C%2FWIC%5C%2FACM%20International%20Conference%20on%20Web%20Intelligence%20%28WI%29%22%2C%22conferenceName%22%3A%222016%20IEEE%5C%2FWIC%5C%2FACM%20International%20Conference%20on%20Web%20Intelligence%20%28WI%29%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1109%5C%2FWI.2016.0015%22%2C%22ISBN%22%3A%22978-1-5090-4470-2%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fieeexplore.ieee.org%5C%2Fdocument%5C%2F7817032%5C%2F%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T06%3A25%3A49Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%223RRGTBRT%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Alexander%20and%20Gleicher%22%2C%22parsedDate%22%3A%222016%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BAlexander%2C%20Eric%2C%20and%20Michael%20Gleicher.%20%26%23x201C%3BTask-Driven%20Comparison%20of%20Topic%20Models.%26%23x201D%3B%20%26lt%3Bi%26gt%3BIEEE%20Transactions%20on%20Visualization%20and%20Computer%20Graphics%26lt%3B%5C%2Fi%26gt%3B%2022%2C%20no.%201%20%282016%29%3A%20320%26%23x2013%3B29.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FTVCG.2015.2467618%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1109%5C%2FTVCG.2015.2467618%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3D3RRGTBRT%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Task-Driven%20Comparison%20of%20Topic%20Models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Eric%22%2C%22lastName%22%3A%22Alexander%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michael%22%2C%22lastName%22%3A%22Gleicher%22%7D%5D%2C%22abstractNote%22%3A%22Topic%20modeling%2C%20a%20method%20of%20statistically%20extracting%20thematic%20content%20from%20a%20large%20collection%20of%20texts%2C%20is%20used%20for%20a%20wide%20variety%20of%20tasks%20within%20text%20analysis.%20Though%20there%20are%20a%20growing%20number%20of%20tools%20and%20techniques%20for%20exploring%20single%20models%2C%20comparisons%20between%20models%20are%20generally%20reduced%20to%20a%20small%20set%20of%20numerical%20metrics.%20These%20metrics%20may%20or%20may%20not%20reflect%20a%20model%26%23039%3Bs%20performance%20on%20the%20analyst%26%23039%3Bs%20intended%20task%2C%20and%20can%20therefore%20be%20insufficient%20to%20diagnose%20what%20causes%20differences%20between%20models.%20In%20this%20paper%2C%20we%20explore%20task-centric%20topic%20model%20comparison%2C%20considering%20how%20we%20can%20both%20provide%20detail%20for%20a%20more%20nuanced%20understanding%20of%20differences%20and%20address%20the%20wealth%20of%20tasks%20for%20which%20topic%20models%20are%20used.%20We%20derive%20comparison%20tasks%20from%20single-model%20uses%20of%20topic%20models%2C%20which%20predominantly%20fall%20into%20the%20categories%20of%20understanding%20topics%2C%20understanding%20similarity%2C%20and%20understanding%20change.%20Finally%2C%20we%20provide%20several%20visualization%20techniques%20that%20facilitate%20these%20tasks%2C%20including%20buddy%20plots%2C%20which%20combine%20color%20and%20position%20encodings%20to%20allow%20analysts%20to%20readily%20view%20changes%20in%20document%20similarity.%22%2C%22date%22%3A%222016%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1109%5C%2FTVCG.2015.2467618%22%2C%22ISSN%22%3A%221077-2626%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fieeexplore.ieee.org%5C%2Fdocument%5C%2F7194832%5C%2F%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-01-03T19%3A57%3A08Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Interpretability%20and%20explainability%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20interpretation%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22HHN5NDJ6%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Murdock%20and%20Allen%22%2C%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BMurdock%2C%20Jaimie%2C%20and%20Colin%20Allen.%20%26%23x201C%3BVisualization%20Techniques%20for%20Topic%20Model%20Checking.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%20Twenty-Ninth%20AAAI%20Conference%20on%20Artificial%20Intelligence%26lt%3B%5C%2Fi%26gt%3B%2C%204284%26%23x2013%3B85.%20AAAI%26%23x2019%3B15.%20Austin%2C%20Texas%3A%20AAAI%20Press%2C%202015.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttp%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D2888116.2888368%26%23039%3B%26gt%3Bhttp%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D2888116.2888368%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DHHN5NDJ6%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Visualization%20Techniques%20for%20Topic%20Model%20Checking%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jaimie%22%2C%22lastName%22%3A%22Murdock%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Colin%22%2C%22lastName%22%3A%22Allen%22%7D%5D%2C%22abstractNote%22%3A%22Topic%20models%20remain%20a%20black%20box%20both%20for%20modelers%20and%20for%20end%20users%20in%20many%20respects.%20From%20the%20modelers%26%23039%3B%20perspective%2C%20many%20decisions%20must%20be%20made%20which%20lack%20clear%20rationales%20and%20whose%20interactions%20are%20unclear.%20Furthermore%2C%20the%20results%20of%20different%20parameter%20settings%20are%20hard%20to%20analyze%2C%20summarize%2C%20and%20visualize%2C%20making%20model%20comparison%20difficult.%20From%20the%20end%20users%26%23039%3B%20perspective%2C%20it%20is%20hard%20to%20understand%20why%20the%20models%20perform%20as%20they%20do%2C%20and%20information-theoretic%20similarity%20measures%20do%20not%20fully%20align%20with%20humanistic%20interpretation%20of%20the%20topics.%20The%20authors%20present%20the%20Topic%20Explorer%2C%20which%20advances%20the%20state-of-the-art%20in%20topic%20model%20visualization%20for%20document-document%20and%20topic-document%20relations.%20It%20brings%20topic%20models%20to%20life%20in%20a%20way%20that%20fosters%20deep%20understanding%20of%20both%20corpus%20and%20models%2C%20allowing%20users%20to%20generate%20interpretive%20hypotheses%20and%20to%20suggest%20further%20experiments.%20Such%20tools%20are%20an%20essential%20step%20toward%20assessing%20whether%20topic%20modeling%20is%20a%20suitable%20technique%20for%20AI%20and%20cognitive%20modeling%20applications.%22%2C%22date%22%3A%222015%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%20Twenty-Ninth%20AAAI%20Conference%20on%20Artificial%20Intelligence%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22978-0-262-51129-2%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D2888116.2888368%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A47%3A49Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20visualization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22HMW4AWR2%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Chuang%20et%20al.%22%2C%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BChuang%2C%20Jason%2C%20Margaret%20E.%20Roberts%2C%20Brandon%20M.%20Stewart%2C%20Rebecca%20Weiss%2C%20Dustin%20Tingley%2C%20Justin%20Grimmer%2C%20and%20Jeffrey%20Heer.%20%26%23x201C%3BTopicCheck%3A%20Interactive%20Alignment%20for%20Assessing%20Topic%20Model%20Stability.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%202015%20Conference%20of%20the%20North%20American%20Chapter%20of%20the%20Association%20for%20Computational%20Linguistics%3A%20Human%20Language%20Technologies%26lt%3B%5C%2Fi%26gt%3B%2C%20175%26%23x2013%3B84.%20Denver%3A%20Association%20for%20Computational%20Linguistics%2C%202015.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.3115%5C%2Fv1%5C%2FN15-1018%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.3115%5C%2Fv1%5C%2FN15-1018%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DHMW4AWR2%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22TopicCheck%3A%20Interactive%20Alignment%20for%20Assessing%20Topic%20Model%20Stability%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jason%22%2C%22lastName%22%3A%22Chuang%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Margaret%20E.%22%2C%22lastName%22%3A%22Roberts%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Brandon%20M.%22%2C%22lastName%22%3A%22Stewart%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Rebecca%22%2C%22lastName%22%3A%22Weiss%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Dustin%22%2C%22lastName%22%3A%22Tingley%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Justin%22%2C%22lastName%22%3A%22Grimmer%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jeffrey%22%2C%22lastName%22%3A%22Heer%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20introduces%20TopicCheck%2C%20an%20interactive%20tool%20for%20assessing%20topic%20model%20stability.%20The%20contributions%20are%20threefold.%20First%2C%20from%20established%20guidelines%20on%20reproducible%20content%20analysis%2C%20it%20distills%20a%20set%20of%20design%20requirements%20on%20how%20to%20computationally%20assess%20the%20stability%20of%20an%20automated%20coding%20process.%20Second%2C%20it%20devises%20an%20interactive%20alignment%20algorithm%20for%20matching%20latent%20topics%20from%20multiple%20models%2C%20and%20enable%20sensitivity%20evaluation%20across%20a%20large%20number%20of%20models.%20Finally%2C%20it%20demonstrates%20that%20our%20tool%20enables%20social%20scientists%20to%20gain%20novel%20insights%20into%20three%20active%20research%20questions.%22%2C%22date%22%3A%222015%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%202015%20Conference%20of%20the%20North%20American%20Chapter%20of%20the%20Association%20for%20Computational%20Linguistics%3A%20Human%20Language%20Technologies%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.3115%5C%2Fv1%5C%2FN15-1018%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Faclweb.org%5C%2Fanthology%5C%2FN15-1018%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A47%3A28Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22RT69KGFV%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Boyd-Graber%20et%20al.%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BBoyd-Graber%2C%20Jordan%2C%20David%20Mimno%2C%20and%20David%20Newman.%20%26%23x201C%3BCare%20and%20Feeding%20of%20Topic%20Models%3A%20Problems%2C%20Diagnostics%2C%20and%20Improvements.%26%23x201D%3B%20Handbook%20of%20Mixed%20Membership%20Models%20and%20Their%20Applications%2C%202014.%20https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1201%5C%2Fb17520-21.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DRT69KGFV%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22webpage%22%2C%22title%22%3A%22Care%20and%20Feeding%20of%20Topic%20Models%3A%20Problems%2C%20Diagnostics%2C%20and%20Improvements%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jordan%22%2C%22lastName%22%3A%22Boyd-Graber%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Mimno%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Newman%22%7D%5D%2C%22abstractNote%22%3A%22Topic%20models%20are%20a%20versatile%20tool%20for%20understanding%20corpora%2C%20but%20they%20are%20not%20perfect.%20In%20this%20chapter%2C%20we%20describe%20the%20problems%20users%20often%20encounter%20when%20using%20topic%20models%20for%20the%20first%20time.%20We%20begin%20with%20the%20preprocessing%20choices%20users%20must%20make%20when%20creating%20a%20corpus%20for%20topic%20modeling%20for%20the%20first%20time%2C%20followed%20by%20options%20users%20have%20for%20running%20topic%20models.%20After%20a%20user%20has%20a%20topic%20model%20learned%20from%20data%2C%20we%20describe%20how%20users%20know%20whether%20they%20have%20a%20good%20topic%20model%20or%20not%20and%20give%20a%20summary%20of%20the%20common%20problems%20users%20have%2C%20and%20how%20those%20problems%20can%20be%20addressed%20and%20solved%20by%20recent%20advances%20in%20both%20models%20and%20tools.%22%2C%22date%22%3A%222014%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.semanticscholar.org%5C%2Fpaper%5C%2F1-Care-and-Feeding-of-Topic-Models-%253A-Problems-%252C-%252C-Mimno%5C%2F24d1e03fa7cd20edd646df8b2c6e43365f2faa10%22%2C%22language%22%3A%22en%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-09-07T02%3A16%3A27Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20interpretation%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20labeling%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22HML7NLEW%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Sievert%20and%20Shirley%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BSievert%2C%20Carson%2C%20and%20Kenneth%20R.%20Shirley.%20%26%23x201C%3BLDAvis%26%23x202F%3B%3A%20A%20Method%20for%20Visualizing%20and%20Interpreting%20Topics.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%20Workshop%20on%20Interactive%20Language%20Learning%2C%20Visualization%2C%20and%20Interfaces%26lt%3B%5C%2Fi%26gt%3B%2C%2063%26%23x2013%3B70.%20Association%20for%20Computational%20Linguistics%2C%202014.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttp%3A%5C%2F%5C%2Fwww.aclweb.org%5C%2Fanthology%5C%2FW14-3110%26%23039%3B%26gt%3Bhttp%3A%5C%2F%5C%2Fwww.aclweb.org%5C%2Fanthology%5C%2FW14-3110%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DHML7NLEW%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22LDAvis%20%3A%20A%20method%20for%20visualizing%20and%20interpreting%20topics%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Carson%22%2C%22lastName%22%3A%22Sievert%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Kenneth%20R.%22%2C%22lastName%22%3A%22Shirley%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20presents%20LDAvis%2C%20a%20web-based%20interactive%20visualization%20of%20topics%20estimated%20using%20Latent%20Dirichlet%20Allocation%20that%20is%20built%20using%20a%20combination%20of%20R%20and%20D3.%20The%20visualization%20provides%20a%20global%20view%20of%20the%20topics%20%28and%20how%20they%20differ%20from%20each%20other%29%2C%20while%20at%20the%20same%20time%20allowing%20for%20a%20deep%20inspection%20of%20the%20terms%20most%20highly%20associated%20with%20each%20individual%20topic.%22%2C%22date%22%3A%222014%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%20Workshop%20on%20Interactive%20Language%20Learning%2C%20Visualization%2C%20and%20Interfaces%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fwww.aclweb.org%5C%2Fanthology%5C%2FW14-3110%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A43%3A17Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20visualization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22C9XXBIT5%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Evans%22%2C%22parsedDate%22%3A%222014%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BEvans%2C%20Michael%20S.%20%26%23x201C%3BA%20Computational%20Approach%20to%20Qualitative%20Analysis%20in%20Large%20Textual%20Datasets.%26%23x201D%3B%20%26lt%3Bi%26gt%3BPLOS%20ONE%26lt%3B%5C%2Fi%26gt%3B%209%2C%20no.%202%20%282014%29%3A%20e87908.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pone.0087908%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1371%5C%2Fjournal.pone.0087908%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DC9XXBIT5%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22A%20Computational%20Approach%20to%20Qualitative%20Analysis%20in%20Large%20Textual%20Datasets%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michael%20S.%22%2C%22lastName%22%3A%22Evans%22%7D%5D%2C%22abstractNote%22%3A%22In%20this%20paper%20I%20introduce%20computational%20techniques%20to%20extend%20qualitative%20analysis%20into%20the%20study%20of%20large%20textual%20datasets.%20I%20demonstrate%20these%20techniques%20by%20using%20probabilistic%20topic%20modeling%20to%20analyze%20a%20broad%20sample%20of%2014%2C952%20documents%20published%20in%20major%20American%20newspapers%20from%201980%20through%202012.%20I%20show%20how%20computational%20data%20mining%20techniques%20can%20identify%20and%20evaluate%20the%20significance%20of%20qualitatively%20distinct%20subjects%20of%20discussion%20across%20a%20wide%20range%20of%20public%20discourse.%20I%20also%20show%20how%20examining%20large%20textual%20datasets%20with%20computational%20methods%20can%20overcome%20methodological%20limitations%20of%20conventional%20qualitative%20methods%2C%20such%20as%20how%20to%20measure%20the%20impact%20of%20particular%20cases%20on%20broader%20discourse%2C%20how%20to%20validate%20substantive%20inferences%20from%20small%20samples%20of%20textual%20data%2C%20and%20how%20to%20determine%20if%20identified%20cases%20are%20part%20of%20a%20consistent%20temporal%20pattern.%22%2C%22date%22%3A%222014%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1371%5C%2Fjournal.pone.0087908%22%2C%22ISSN%22%3A%221932-6203%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fjournals.plos.org%5C%2Fplosone%5C%2Farticle%3Fid%3D10.1371%5C%2Fjournal.pone.0087908%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-07-01T20%3A51%3A16Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%227NYTEB9R%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Chuang%20et%20al.%22%2C%22parsedDate%22%3A%222013%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BChuang%2C%20Jason%2C%20Sonal%20Gupta%2C%20Christopher%20D.%20Manning%2C%20and%20Jeffrey%20Heer.%20%26%23x201C%3BTopic%20Model%20Diagnostics%3A%20Assessing%20Domain%20Relevance%20via%20Topical%20Alignment.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%2030th%20International%20Conference%20on%20International%20Conference%20on%20Machine%20Learning%20-%20Volume%2028%26lt%3B%5C%2Fi%26gt%3B%2C%20III-612-III%26%23x2013%3B620.%20ICML%26%23x2019%3B13.%20Atlanta%2C%20GA%2C%20USA%3A%20JMLR.org%2C%202013.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttp%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D3042817.3043005%26%23039%3B%26gt%3Bhttp%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D3042817.3043005%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3D7NYTEB9R%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Topic%20Model%20Diagnostics%3A%20Assessing%20Domain%20Relevance%20via%20Topical%20Alignment%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jason%22%2C%22lastName%22%3A%22Chuang%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sonal%22%2C%22lastName%22%3A%22Gupta%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christopher%20D.%22%2C%22lastName%22%3A%22Manning%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jeffrey%22%2C%22lastName%22%3A%22Heer%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20introduces%20a%20framework%20to%20support%20large-scale%20assessment%20of%20topical%20relevance.%20It%20measures%20the%20correspondence%20between%20a%20set%20of%20latent%20topics%20and%20a%20set%20of%20reference%20concepts%20to%20quantify%20four%20types%20of%20topical%20misalignment%3A%20junk%2C%20fused%2C%20missing%2C%20and%20repeated%20topics.%20The%20analysis%20compares%2010%2C000%20topic%20model%20variants%20to%20200%20expert-provided%20domain%20concepts%2C%20and%20demonstrates%20how%20our%20framework%20can%20inform%20choices%20of%20model%20parameters%2C%20inference%20algorithms%2C%20and%20intrinsic%20measures%20of%20topical%20quality.%22%2C%22date%22%3A%222013%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%2030th%20International%20Conference%20on%20International%20Conference%20on%20Machine%20Learning%20-%20Volume%2028%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D3042817.3043005%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-07-01T20%3A52%3A53Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22I8P2RHHC%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Chen%20et%20al.%22%2C%22parsedDate%22%3A%222013%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BChen%2C%20Zhiyuan%2C%20Arjun%20Mukherjee%2C%20Bing%20Liu%2C%20Meichun%20Hsu%2C%20Malu%20Castellanos%2C%20and%20Riddhiman%20Ghosh.%20%26%23x201C%3BDiscovering%20Coherent%20Topics%20Using%20General%20Knowledge.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%2022Nd%20ACM%20International%20Conference%20on%20Information%20%26amp%3B%20Knowledge%20Management%26lt%3B%5C%2Fi%26gt%3B%2C%20209%26%23x2013%3B18.%20CIKM%20%26%23x2019%3B13.%20New%20York%2C%20NY%2C%20USA%3A%20ACM%2C%202013.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F2505515.2505519%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F2505515.2505519%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DI8P2RHHC%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Discovering%20Coherent%20Topics%20Using%20General%20Knowledge%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Zhiyuan%22%2C%22lastName%22%3A%22Chen%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Arjun%22%2C%22lastName%22%3A%22Mukherjee%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Bing%22%2C%22lastName%22%3A%22Liu%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Meichun%22%2C%22lastName%22%3A%22Hsu%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Malu%22%2C%22lastName%22%3A%22Castellanos%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Riddhiman%22%2C%22lastName%22%3A%22Ghosh%22%7D%5D%2C%22abstractNote%22%3A%22Topic%20models%20have%20been%20widely%20used%20to%20discover%20latent%20topics%20in%20text%20documents.%20However%2C%20they%20may%20produce%20topics%20that%20are%20not%20interpretable%20for%20an%20application.%20Researchers%20have%20proposed%20to%20incorporate%20prior%20domain%20knowledge%20into%20topic%20models%20to%20help%20produce%20coherent%20topics.%20The%20knowledge%20used%20in%20existing%20models%20is%20typically%20domain%20dependent%20and%20assumed%20to%20be%20correct.%20However%2C%20one%20key%20weakness%20of%20this%20knowledge-based%20approach%20is%20that%20it%20requires%20the%20user%20to%20know%20the%20domain%20very%20well%20and%20to%20be%20able%20to%20provide%20knowledge%20suitable%20for%20the%20domain%2C%20which%20is%20not%20always%20the%20case%20because%20in%20most%20real-life%20applications%2C%20the%20user%20wants%20to%20find%20what%20they%20do%20not%20know.%20In%20this%20paper%2C%20we%20propose%20a%20framework%20to%20leverage%20the%20general%20knowledge%20in%20topic%20models.%20Such%20knowledge%20is%20domain%20independent.%20Specifically%2C%20we%20use%20one%20form%20of%20general%20knowledge%2C%20i.e.%2C%20lexical%20semantic%20relations%20of%20words%20such%20as%20synonyms%2C%20antonyms%20and%20adjective%20attributes%2C%20to%20help%20produce%20more%20coherent%20topics.%20However%2C%20there%20is%20a%20major%20obstacle%2C%20i.e.%2C%20a%20word%20can%20have%20multiple%20meanings%5C%2Fsenses%20and%20each%20meaning%20often%20has%20a%20different%20set%20of%20synonyms%20and%20antonyms.%20Not%20every%20meaning%20is%20suitable%20or%20correct%20for%20a%20domain.%20Wrong%20knowledge%20can%20result%20in%20poor%20quality%20topics.%20To%20deal%20with%20wrong%20knowledge%2C%20we%20propose%20a%20new%20model%2C%20called%20GK-LDA%2C%20which%20is%20able%20to%20effectively%20exploit%20the%20knowledge%20of%20lexical%20relations%20in%20dictionaries.%20To%20the%20best%20of%20our%20knowledge%2C%20GK-LDA%20is%20the%20first%20such%20model%20that%20can%20incorporate%20the%20domain%20independent%20knowledge.%20Our%20experiments%20using%20online%20product%20reviews%20show%20that%20GK-LDA%20performs%20significantly%20better%20than%20existing%20state-of-the-art%20models.%22%2C%22date%22%3A%222013%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%2022Nd%20ACM%20International%20Conference%20on%20Information%20%26%20Knowledge%20Management%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1145%5C%2F2505515.2505519%22%2C%22ISBN%22%3A%22978-1-4503-2263-8%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdoi.acm.org%5C%2F10.1145%5C%2F2505515.2505519%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T06%3A31%3A35Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22P2TPVB22%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Chuang%20et%20al.%22%2C%22parsedDate%22%3A%222012%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BChuang%2C%20Jason%2C%20Daniel%20Ramage%2C%20Christopher%20Manning%2C%20and%20Jeffrey%20Heer.%20%26%23x201C%3BInterpretation%20and%20Trust%3A%20Designing%20Model-Driven%20Visualizations%20for%20Text%20Analysis.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%20SIGCHI%20Conference%20on%20Human%20Factors%20in%20Computing%20Systems%26lt%3B%5C%2Fi%26gt%3B%2C%20443%26%23x2013%3B52.%20CHI%20%26%23x2019%3B12.%20New%20York%2C%20NY%2C%20USA%3A%20ACM%2C%202012.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F2207676.2207738%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F2207676.2207738%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DP2TPVB22%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Interpretation%20and%20Trust%3A%20Designing%20Model-driven%20Visualizations%20for%20Text%20Analysis%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jason%22%2C%22lastName%22%3A%22Chuang%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Ramage%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christopher%22%2C%22lastName%22%3A%22Manning%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jeffrey%22%2C%22lastName%22%3A%22Heer%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20offers%20two%20design%20considerations%20-%20interpretation%20and%20trust%20-%20for%20designing%20visualizations%20based%20on%20data-driven%20models.%20Interpretation%20refers%20to%20the%20facility%20with%20which%20an%20analyst%20makes%20inferences%20about%20the%20data%20through%20the%20lens%20of%20a%20model%20abstraction.%20Trust%20refers%20to%20the%20actual%20and%20perceived%20accuracy%20of%20an%20analyst%26%23039%3Bs%20inferences.%20These%20considerations%20derive%20from%20experiences%20in%20developing%20the%20Stanford%20Dissertation%20Browser%2C%20a%20tool%20for%20exploring%20over%209%2C000%20Ph.D.%20theses%20by%20topical%20similarity%2C%20and%20a%20subsequent%20review%20of%20existing%20literature.%22%2C%22date%22%3A%222012%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%20SIGCHI%20Conference%20on%20Human%20Factors%20in%20Computing%20Systems%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1145%5C%2F2207676.2207738%22%2C%22ISBN%22%3A%22978-1-4503-1015-4%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdoi.acm.org%5C%2F10.1145%5C%2F2207676.2207738%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A42%3A45Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20clusters%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20interpretation%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20visualization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22PKJHFGG8%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Ponweiser%22%2C%22parsedDate%22%3A%222012%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BPonweiser%2C%20Martin.%20%26%23x201C%3BLatent%20Dirichlet%20Allocation%20in%20R.%26%23x201D%3B%20Diploma%20Thesis%2C%20Vienna%20University%20of%20Economics%20and%20Business%2C%202012.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttp%3A%5C%2F%5C%2Fepub.wu.ac.at%5C%2F3558%5C%2F%26%23039%3B%26gt%3Bhttp%3A%5C%2F%5C%2Fepub.wu.ac.at%5C%2F3558%5C%2F%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DPKJHFGG8%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22thesis%22%2C%22title%22%3A%22Latent%20Dirichlet%20Allocation%20in%20R%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Martin%22%2C%22lastName%22%3A%22Ponweiser%22%7D%5D%2C%22abstractNote%22%3A%22This%20thesis%20focuses%20on%20LDA%26%23039%3Bs%20practical%20application.%20Its%20main%20goal%20is%20the%20replication%20of%20the%20data%20analyses%20from%20the%202004%20LDA%20paper%20%5Cu201cFinding%20scientific%20topics%5Cu201d%20by%20Thomas%20Griffiths%20and%20Mark%20Steyvers%20within%20the%20framework%20of%20the%20R%20statistical%20programming%20language%20and%20the%20R%5C%5Ctextasciitildepackage%20topic%20models%20by%20Bettina%20Gr%5Cu00c3%5Cu00bcn%20and%20Kurt%20Hornik.%20The%20complete%20process%2C%20including%20extraction%20of%20a%20text%20corpus%20from%20the%20PNAS%20journal%26%23039%3Bs%20website%2C%20data%20preprocessing%2C%20transformation%20into%20a%20document-term%20matrix%2C%20model%20selection%2C%20model%20estimation%2C%20as%20well%20as%20presentation%20of%20the%20results%2C%20is%20fully%20documented%20and%20commented.%20The%20outcome%20closely%20matches%20the%20analyses%20of%20the%20original%20paper%2C%20therefore%20the%20research%20by%20Griffiths%5C%2FSteyvers%20can%20be%20reproduced.%20Furthermore%2C%20this%20thesis%20proves%20the%20suitability%20of%20the%20R%20environment%20for%20text%20mining%20with%20LDA.%22%2C%22thesisType%22%3A%22Diploma%20Thesis%22%2C%22university%22%3A%22Vienna%20University%20of%20Economics%20and%20Business%22%2C%22date%22%3A%222012%22%2C%22language%22%3A%22en%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fepub.wu.ac.at%5C%2F3558%5C%2F%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A43%3A13Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20algorithm%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22KRM8NHH8%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Matt%20Taddy%22%2C%22parsedDate%22%3A%222012%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BMatt%20Taddy.%20%26%23x201C%3BOn%20Estimation%20and%20Selection%20for%20Topic%20Models.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%20Fifteenth%20International%20Conference%20on%20Artificial%20Intelligence%20and%20Statistics%26lt%3B%5C%2Fi%26gt%3B%2C%20edited%20by%20Neil%20D.%20Lawrence%20and%20Mark%20Girolami%2C%201184%26%23x2013%3B93.%20PMLR%2C%202012.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttp%3A%5C%2F%5C%2Fproceedings.mlr.press%5C%2Fv22%5C%2Ftaddy12.html%26%23039%3B%26gt%3Bhttp%3A%5C%2F%5C%2Fproceedings.mlr.press%5C%2Fv22%5C%2Ftaddy12.html%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DKRM8NHH8%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22On%20Estimation%20and%20Selection%20for%20Topic%20Models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22name%22%3A%22Matt%20Taddy%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22name%22%3A%22Neil%20D.%20Lawrence%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22name%22%3A%22Mark%20Girolami%22%7D%5D%2C%22abstractNote%22%3A%22This%20article%20describes%20posterior%20maximization%20for%20topic%20models%2C%20identifying%20computational%20and%20conceptual%20gains%20from%20inference%20under%20a%20non-standard%20parametrization.%20Then%20shows%20that%20fitted%20parameters%20can%20be%20used%20as%20the%20basis%20for%20a%20novel%20approach%20to%20marginal%20likelihood%20estimation%2C%20via%20block-diagonal%20approximation%20to%20the%20information%20matrix%2C%20that%20facilitates%20choosing%20the%20number%20of%20latent%20topics.%20This%20likelihood-based%20model%20selection%20is%20complemented%20with%20a%20goodness-of-fit%20analysis%20built%20around%20estimated%20residual%20dispersion.%20Examples%20are%20provided%20to%20illustrate%20model%20selection%20as%20well%20as%20to%20compare%20our%20estimation%20against%20standard%20alternative%20techniques.%22%2C%22date%22%3A%222012%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%20Fifteenth%20International%20Conference%20on%20Artificial%20Intelligence%20and%20Statistics%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fproceedings.mlr.press%5C%2Fv22%5C%2Ftaddy12.html%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A44%3A24Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22KAIJ9C5V%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Bischof%20and%20Airoldi%22%2C%22parsedDate%22%3A%222012%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BBischof%2C%20Jonathan%20M.%2C%20and%20Edoardo%20M.%20Airoldi.%20%26%23x201C%3BSummarizing%20Topical%20Content%20with%20Word%20Frequency%20and%20Exclusivity.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%2029th%20International%20Coference%20on%20International%20Conference%20on%20Machine%20Learning%26lt%3B%5C%2Fi%26gt%3B%2C%209%26%23x2013%3B16.%20ICML%26%23x2019%3B12.%20USA%3A%20Omnipress%2C%202012.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Ficml.cc%5C%2F2012%5C%2Fpapers%5C%2F113.pdf%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Ficml.cc%5C%2F2012%5C%2Fpapers%5C%2F113.pdf%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DKAIJ9C5V%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Summarizing%20Topical%20Content%20with%20Word%20Frequency%20and%20Exclusivity%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jonathan%20M.%22%2C%22lastName%22%3A%22Bischof%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Edoardo%20M.%22%2C%22lastName%22%3A%22Airoldi%22%7D%5D%2C%22abstractNote%22%3A%22Hierarchical%20Poisson%20Convolution%20%28HPC%29%2C%20a%20model%20which%20infers%20regularized%20estimates%20of%20the%20differential%20use%20of%20words%20across%20topics%20as%20well%20as%20their%20frequency%20within%20topics.%20HPC%20uses%20known%20hierarchical%20structure%20on%20human-labeled%20topics%20to%20make%20focused%20comparisons%20of%20differential%20usage%20within%20each%20branch%20of%20the%20hierarchy%20of%20labels.%20This%20work%20provides%20a%20summary%20for%20each%20topic%20in%20terms%20of%20words%20that%20are%20both%20frequent%20and%20exclusive.%22%2C%22date%22%3A%222012%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%2029th%20International%20Coference%20on%20International%20Conference%20on%20Machine%20Learning%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22978-1-4503-1285-1%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Ficml.cc%5C%2F2012%5C%2Fpapers%5C%2F113.pdf%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A45%3A38Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%227RCXUJRT%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Mimno%20and%20Blei%22%2C%22parsedDate%22%3A%222011%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BMimno%2C%20David%2C%20and%20David%20Blei.%20%26%23x201C%3BBayesian%20Checking%20for%20Topic%20Models.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%20Conference%20on%20Empirical%20Methods%20in%20Natural%20Language%20Processing%26lt%3B%5C%2Fi%26gt%3B%2C%20227%26%23x2013%3B37.%20Association%20for%20Computational%20Linguistics%2C%202011.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3D7RCXUJRT%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Bayesian%20checking%20for%20topic%20models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Mimno%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Blei%22%7D%5D%2C%22abstractNote%22%3A%22Real%20document%20collections%20do%20not%20fit%20the%20independence%20assumptions%20asserted%20by%20most%20statistical%20topic%20models%2C%20but%20how%20badly%20do%20they%20violate%20them%3F%20This%20paper%20presents%20a%20Bayesian%20method%20for%20measuring%20how%20well%20a%20topic%20model%20fits%20a%20corpus.%20The%20approach%20is%20based%20on%20posterior%20predictive%20checking%2C%20a%20method%20for%20diagnosing%20Bayesian%20models%20in%20user-defined%20ways.%20The%20method%20can%20identify%20where%20a%20topic%20model%20fits%20the%20data%2C%20where%20it%20falls%20short%2C%20and%20in%20which%20directions%20it%20might%20be%20improved.%22%2C%22date%22%3A%222011%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%20conference%20on%20empirical%20methods%20in%20natural%20language%20processing%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-07-01T20%3A57%3A38Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Natural%20language%20processing%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22KRD7YKRH%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A1550555%2C%22username%22%3A%22nazkey%22%2C%22name%22%3A%22Naz%20Keynejad%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fnazkey%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Gavagai%20%28company%29%22%2C%22parsedDate%22%3A%222011%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BGavagai%20%28company%29.%20%26%23x201C%3BThe%20Advantage%20of%20Ethersource%20on%20the%20TOEFL%20Synonym%20Test%20Compared%20to%20Other%20Methods.%26%23x201D%3B%20%26lt%3Bi%26gt%3BGavagai%26lt%3B%5C%2Fi%26gt%3B%20%28blog%29%2C%202011.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fwww.gavagai.io%5C%2Fblog%5C%2F2011%5C%2F12%5C%2F14%5C%2Fthe-advantage-of-ethersource-on-the-toefl-synonym-test-compared-to-other-methods%5C%2F%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fwww.gavagai.io%5C%2Fblog%5C%2F2011%5C%2F12%5C%2F14%5C%2Fthe-advantage-of-ethersource-on-the-toefl-synonym-test-compared-to-other-methods%5C%2F%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DKRD7YKRH%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22blogPost%22%2C%22title%22%3A%22The%20Advantage%20of%20Ethersource%20on%20the%20TOEFL%20Synonym%20Test%20Compared%20to%20other%20Methods%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22%22%2C%22lastName%22%3A%22Gavagai%20%28company%29%22%7D%5D%2C%22abstractNote%22%3A%22This%20post%20compares%20the%20performance%20of%20various%20semantic%20algorithms%20Ethersource%20solves%20a%20synonym%20test%20with%2062%25%20correct%20answers%2C%20while%20the%20best%20runner-up%20only%20reaches%2052%25%20The%20results%20demonstrate%20the%20advantage%20of%20Ethersource%20over%20other%20relevant%20methods%20As%20part%20of%20our%20internal%20system%20performance%20monitoring%2C%20we%20continuously%20evaluate%20Ethersource%20using%20a%20number%20of%20standardized%20benchmark%20tests.%20%5Cu2026%22%2C%22blogTitle%22%3A%22Gavagai%22%2C%22date%22%3A%222011%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.gavagai.io%5C%2Fblog%5C%2F2011%5C%2F12%5C%2F14%5C%2Fthe-advantage-of-ethersource-on-the-toefl-synonym-test-compared-to-other-methods%5C%2F%22%2C%22language%22%3A%22en%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-07-01T20%3A54%3A57Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%5D%7D%7D%2C%7B%22key%22%3A%225RKUY2I6%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Ratinov%20et%20al.%22%2C%22parsedDate%22%3A%222011%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BRatinov%2C%20Lev%2C%20Dan%20Roth%2C%20Doug%20Downey%2C%20and%20Mike%20Anderson.%20%26%23x201C%3BLocal%20and%20Global%20Algorithms%20for%20Disambiguation%20to%20Wikipedia%2C%26%23x201D%3B%201375%26%23x2013%3B84%2C%202011.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Faclweb.org%5C%2Fanthology%5C%2Fpapers%5C%2FP%5C%2FP11%5C%2FP11-1138%5C%2F%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Faclweb.org%5C%2Fanthology%5C%2Fpapers%5C%2FP%5C%2FP11%5C%2FP11-1138%5C%2F%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3D5RKUY2I6%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Local%20and%20Global%20Algorithms%20for%20Disambiguation%20to%20Wikipedia%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Lev%22%2C%22lastName%22%3A%22Ratinov%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Dan%22%2C%22lastName%22%3A%22Roth%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Doug%22%2C%22lastName%22%3A%22Downey%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Mike%22%2C%22lastName%22%3A%22Anderson%22%7D%5D%2C%22abstractNote%22%3A%22Disambiguating%20concepts%20and%20entities%20in%20a%20context%20sensitive%20way%20is%20a%20fundamental%20problem%20in%20natural%20language%20processing.%20The%20comprehensiveness%20of%20Wikipedia%20has%20made%20the%20online%20encyclopedia%20an%20increasingly%20popular%20target%20for%20disambiguation.%20Disambiguation%20to%20Wikipedia%20is%20similar%20to%20a%20traditional%20Word%20Sense%20Disambiguation%20task%2C%20but%20distinct%20in%20that%20the%20Wikipedia%20link%20structure%20provides%20additional%20information%20about%20which%20disambiguations%20are%20compatible.%20In%20this%20work%20the%20authors%20analyze%20approaches%20that%20utilize%20this%20information%20to%20arrive%20at%20coherent%20sets%20of%20disambiguations%20for%20a%20given%20document%20%28which%20we%20call%20%5Cu201cglobal%5Cu201d%20approaches%29%2C%20and%20compare%20them%20to%20more%20traditional%20%28local%29%20approaches.%20They%20show%20that%20previous%20approaches%20for%20global%20disambiguation%20can%20be%20improved%2C%20but%20even%20then%20the%20local%20disambiguation%20provides%20a%20baseline%20which%20is%20very%20hard%20to%20beat.%22%2C%22date%22%3A%222011%22%2C%22proceedingsTitle%22%3A%22%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Faclweb.org%5C%2Fanthology%5C%2Fpapers%5C%2FP%5C%2FP11%5C%2FP11-1138%5C%2F%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A43%3A25Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Natural%20language%20processing%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Wikification%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22FJ8B6MYM%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Arun%20et%20al.%22%2C%22parsedDate%22%3A%222010%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BArun%2C%20R.%2C%20V.%20Suresh%2C%20C.%20E.%20Veni%20Madhavan%2C%20and%20M.%20N.%20Narasimha%20Murthy.%20%26%23x201C%3BOn%20Finding%20the%20Natural%20Number%20of%20Topics%20with%20Latent%20Dirichlet%20Allocation%3A%20Some%20Observations.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BAdvances%20in%20Knowledge%20Discovery%20and%20Data%20Mining%26lt%3B%5C%2Fi%26gt%3B%2C%20edited%20by%20Mohammed%20J.%20Zaki%2C%20Jeffrey%20Xu%20Yu%2C%20B.%20Ravindran%2C%20and%20Vikram%20Pudi%2C%20391%26%23x2013%3B402.%20Lecture%20Notes%20in%20Computer%20Science.%20Springer%20Berlin%20Heidelberg%2C%202010.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DFJ8B6MYM%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22On%20Finding%20the%20Natural%20Number%20of%20Topics%20with%20Latent%20Dirichlet%20Allocation%3A%20Some%20Observations%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22R.%22%2C%22lastName%22%3A%22Arun%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22V.%22%2C%22lastName%22%3A%22Suresh%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22C.%20E.%22%2C%22lastName%22%3A%22Veni%20Madhavan%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M.%20N.%22%2C%22lastName%22%3A%22Narasimha%20Murthy%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Mohammed%20J.%22%2C%22lastName%22%3A%22Zaki%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Jeffrey%20Xu%22%2C%22lastName%22%3A%22Yu%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22B.%22%2C%22lastName%22%3A%22Ravindran%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Vikram%22%2C%22lastName%22%3A%22Pudi%22%7D%5D%2C%22abstractNote%22%3A%22This%20work%20proposes%20a%20measure%20to%20identify%20the%20correct%20number%20of%20topics%20and%20offer%20empirical%20evidence%20in%20its%20favor%20in%20terms%20of%20classification%20accuracy%20and%20the%20number%20of%20topics%20that%20are%20naturally%20present%20in%20the%20corpus.%20The%20measure%26%23039%3Bs%20merit%20is%20shown%20by%20applying%20it%20on%20real-world%20as%20well%20as%20synthetic%20data%20sets%28both%20text%20and%20images%29.%20In%20proposing%20this%20measure%2C%20view%20LDA%20as%20a%20matrix%20factorization%20mechanism%2C%20wherein%20a%20given%20corpus%20C%20is%20split%20into%20two%20matrix%20factors%20M%201%20and%20M%202%20as%20given%20by%20C%20d%2Aw%20%3D%20M1%20d%2At%20x%20Q%20t%2Aw%20.%22%2C%22date%22%3A%222010%22%2C%22proceedingsTitle%22%3A%22Advances%20in%20Knowledge%20Discovery%20and%20Data%20Mining%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22978-3-642-13657-3%22%2C%22url%22%3A%22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A44%3A27Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22WWSATTZU%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Cao%20et%20al.%22%2C%22parsedDate%22%3A%222009%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BCao%2C%20Juan%2C%20Tian%20Xia%2C%20Jintao%20Li%2C%20Yongdong%20Zhang%2C%20and%20Sheng%20Tang.%20%26%23x201C%3BA%20Density-Based%20Method%20for%20Adaptive%20LDA%20Model%20Selection.%26%23x201D%3B%20%26lt%3Bi%26gt%3BNeurocomputing%26lt%3B%5C%2Fi%26gt%3B%2C%20Advances%20in%20Machine%20Learning%20and%20Computational%20Intelligence%2C%2072%2C%20no.%207%20%282009%29%3A%201775%26%23x2013%3B81.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.neucom.2008.06.011%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1016%5C%2Fj.neucom.2008.06.011%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DWWSATTZU%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22A%20density-based%20method%20for%20adaptive%20LDA%20model%20selection%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Juan%22%2C%22lastName%22%3A%22Cao%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Tian%22%2C%22lastName%22%3A%22Xia%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jintao%22%2C%22lastName%22%3A%22Li%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Yongdong%22%2C%22lastName%22%3A%22Zhang%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Sheng%22%2C%22lastName%22%3A%22Tang%22%7D%5D%2C%22abstractNote%22%3A%22Topic%20models%20have%20been%20successfully%20used%20in%20information%20classification%20and%20retrieval.%20These%20models%20can%20capture%20word%20correlations%20in%20a%20collection%20of%20textual%20documents%20with%20a%20low-dimensional%20set%20of%20multinomial%20distribution%2C%20called%20%5Cu201ctopics%5Cu201d.%20However%2C%20it%20is%20important%20but%20difficult%20to%20select%20the%20appropriate%20number%20of%20topics%20for%20a%20specific%20dataset.%20In%20this%20paper%2C%20we%20study%20the%20inherent%20connection%20between%20the%20best%20topic%20structure%20and%20the%20distances%20among%20topics%20in%20Latent%20Dirichlet%20allocation%20%28LDA%29%2C%20and%20propose%20a%20method%20of%20adaptively%20selecting%20the%20best%20LDA%20model%20based%20on%20density.%20Experiments%20show%20that%20the%20proposed%20method%20can%20achieve%20performance%20matching%20the%20best%20of%20LDA%20without%20manually%20tuning%20the%20number%20of%20topics.%22%2C%22date%22%3A%222009%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1016%5C%2Fj.neucom.2008.06.011%22%2C%22ISSN%22%3A%220925-2312%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fwww.sciencedirect.com%5C%2Fscience%5C%2Farticle%5C%2Fpii%5C%2FS092523120800372X%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A26%3A11Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22G59MALIL%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22AlSumait%20et%20al.%22%2C%22parsedDate%22%3A%222009%22%2C%22numChildren%22%3A3%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BAlSumait%2C%20Loulwah%2C%20Daniel%20Barbar%26%23xE1%3B%2C%20James%20Gentle%2C%20and%20Carlotta%20Domeniconi.%20%26%23x201C%3BTopic%20Significance%20Ranking%20of%20LDA%20Generative%20Models.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BProceedings%20of%20the%202009th%20European%20Conference%20on%20Machine%20Learning%20and%20Knowledge%20Discovery%20in%20Databases%20-%20Volume%20Part%20I%26lt%3B%5C%2Fi%26gt%3B%2C%2067%26%23x2013%3B82.%20ECMLPKDD%26%23x2019%3B09.%20Berlin%2C%20Heidelberg%3A%20Springer-Verlag%2C%202009.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-3-642-04180-8_22%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-3-642-04180-8_22%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DG59MALIL%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Topic%20Significance%20Ranking%20of%20LDA%20Generative%20Models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Loulwah%22%2C%22lastName%22%3A%22AlSumait%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Daniel%22%2C%22lastName%22%3A%22Barbar%5Cu00e1%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22James%22%2C%22lastName%22%3A%22Gentle%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Carlotta%22%2C%22lastName%22%3A%22Domeniconi%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20presents%20the%20first%20automated%20unsupervised%20analysis%20of%20LDA%20models%20to%20identify%20junk%20topics%20from%20legitimate%20ones%2C%20and%20to%20rank%20the%20topic%20significance.%20Basically%2C%20the%20distance%20between%20a%20topic%20distribution%20and%20three%20definitions%20of%20%26quot%3Bjunk%20distribution%26quot%3B%20is%20computed%20using%20a%20variety%20of%20measures%2C%20from%20which%20an%20expressive%20figure%20of%20the%20topic%20significance%20is%20implemented%20using%204-phase%20Weighted%20Combination%20approach.%20These%20experiments%20show%20the%20effectiveness%20of%20the%20proposed%20approach%20in%20ranking%20the%20topic%20significance.%22%2C%22date%22%3A%222009%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%202009th%20European%20Conference%20on%20Machine%20Learning%20and%20Knowledge%20Discovery%20in%20Databases%20-%20Volume%20Part%20I%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1007%5C%2F978-3-642-04180-8_22%22%2C%22ISBN%22%3A%22978-3-642-04179-2%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-3-642-04180-8_22%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A47%3A21Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22VVVQH74T%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Matveeva%20et%20al.%22%2C%22parsedDate%22%3A%222007%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BMatveeva%2C%20Irina%2C%20Gina-anne%20Levow%2C%20Ayman%20Farahat%2C%20and%20Christiaan%20Royer.%20%26%23x201C%3BTerms%20Representation%20with%20Generalized%20Latent%20Semantic%20Analysis.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BRecent%20Advances%20in%20Natural%20Language%20Processing%20IV%3A%20Selected%20Papers%20from%20RANLP%202005%26lt%3B%5C%2Fi%26gt%3B%2C%20292%3A45%26%23x2013%3B54.%20Amsterdam%20Studies%20in%20the%20Theory%20and%20History%20of%20Linguistic%20Science.%20Amsterdam%3B%20Philadelphia%3A%20John%20Benjamins%2C%202007.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttp%3A%5C%2F%5C%2Fciteseerx.ist.psu.edu%5C%2Fviewdoc%5C%2Fsummary%3Fdoi%3D10.1.1.110.2216%26%23039%3B%26gt%3Bhttp%3A%5C%2F%5C%2Fciteseerx.ist.psu.edu%5C%2Fviewdoc%5C%2Fsummary%3Fdoi%3D10.1.1.110.2216%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DVVVQH74T%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22bookSection%22%2C%22title%22%3A%22Terms%20representation%20with%20generalized%20latent%20semantic%20analysis%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Irina%22%2C%22lastName%22%3A%22Matveeva%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gina-anne%22%2C%22lastName%22%3A%22Levow%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ayman%22%2C%22lastName%22%3A%22Farahat%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Christiaan%22%2C%22lastName%22%3A%22Royer%22%7D%5D%2C%22abstractNote%22%3A%22Document%20indexing%20and%20representation%20of%20termdocument%20relations%20are%20very%20important%20issues%20for%20document%20clustering%20and%20retrieval.%20In%20this%20paper%2C%20we%20present%20Generalized%20Latent%20Semantic%20Analysis%20as%20a%20framework%20for%20computing%20semantically%20motivated%20term%20and%20document%20vectors.%20Our%20focus%20on%20term%20vectors%20is%20motivated%20by%20the%20recent%20success%20of%20co-occurrence%20based%20measures%20of%20semantic%20similarity%20obtained%20from%20very%20large%20corpora.%20Our%20experiments%20demonstrate%20that%20GLSA%20term%20vectors%20efficiently%20capture%20semantic%20relations%20between%20terms%20and%20outperform%20related%20approaches%20on%20the%20synonymy%20test.%22%2C%22bookTitle%22%3A%22Recent%20Advances%20in%20Natural%20Language%20Processing%20IV%3A%20Selected%20Papers%20from%20RANLP%202005%22%2C%22date%22%3A%222007%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fciteseerx.ist.psu.edu%5C%2Fviewdoc%5C%2Fsummary%3Fdoi%3D10.1.1.110.2216%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T06%3A08%3A01Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22PVHYW347%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Newman%20et%20al.%22%2C%22parsedDate%22%3A%222006%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BNewman%2C%20David%2C%20Jey%20Han%20Lau%2C%20Karl%20Grieser%2C%20and%20Timothy%20Baldwin.%20%26%23x201C%3BAutomatic%20Evaluation%20of%20Topic%20Coherence.%26%23x201D%3B%20In%20%26lt%3Bi%26gt%3BCOLING-ACL%202006%3A%2021st%20International%20Conference%20on%20Computational%20Linguistics%20and%2044th%20Annual%20Meeting%20of%20the%20Association%20for%20Computational%20Linguistics%3B%2017%20-%2021%20July%202006%2C%20Sydney%2C%20Australia%3B%20Proceedings%20of%20the%20Conference.%20Vol.%201%26lt%3B%5C%2Fi%26gt%3B%2C%20Vol.%201.%20Stroudsburg%2C%20PA%3A%20Association%20for%20Computational%20Linguistics%2C%202006.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-ItemURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D1858011%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D1858011%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DPVHYW347%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Automatic%20evaluation%20of%20topic%20coherence%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Newman%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jey%20Han%22%2C%22lastName%22%3A%22Lau%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Karl%22%2C%22lastName%22%3A%22Grieser%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Timothy%22%2C%22lastName%22%3A%22Baldwin%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20introduces%20the%20novel%20task%20of%20topic%20coherence%20evaluation%2C%20whereby%20a%20set%20of%20words%2C%20as%20generated%20by%20a%20topic%20model%2C%20is%20rated%20for%20coherence%20or%20interpretability.%20The%20authors%20apply%20a%20range%20of%20topic%20scoring%20models%20to%20the%20evaluation%20task%2C%20drawing%20on%20WordNet%2C%20Wikipedia%20and%20the%20Google%20search%20engine%2C%20and%20existing%20research%20on%20lexical%20similarity%5C%2Frelatedness.%20In%20comparison%20with%20human%20scores%20for%20a%20set%20of%20learned%20topics%20over%20two%20distinct%20datasets%2C%20the%20authors%20show%20a%20simple%20co-occurrence%20measure%20based%20on%20pointwise%20mutual%20information%20over%20Wikipedia%20data%20is%20able%20to%20achieve%20results%20for%20the%20task%20at%20or%20nearing%20the%20level%20of%20inter-annotator%20correlation%2C%20and%20that%20other%20Wikipedia-based%20lexical%20relatedness%20methods%20also%20achieve%20strong%20results.%20Google%20produces%20strong%2C%20if%20less%20consistent%2C%20results%2C%20while%20their%20results%20over%20WordNet%20are%20patchy%20at%20best.%22%2C%22date%22%3A%222006%22%2C%22proceedingsTitle%22%3A%22COLING-ACL%202006%3A%2021st%20International%20Conference%20on%20Computational%20Linguistics%20and%2044th%20Annual%20Meeting%20of%20the%20Association%20for%20Computational%20Linguistics%3B%2017%20-%2021%20July%202006%2C%20Sydney%2C%20Australia%3B%20Proceedings%20of%20the%20Conference.%20Vol.%201%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%221-932432-65-5%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D1858011%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A26%3A17Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22MCBIJE58%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Griffiths%20and%20Steyvers%22%2C%22parsedDate%22%3A%222004%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%26lt%3Bdiv%20class%3D%26quot%3Bcsl-bib-body%26quot%3B%20style%3D%26quot%3Bline-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%26quot%3B%26gt%3B%5Cn%20%20%26lt%3Bdiv%20class%3D%26quot%3Bcsl-entry%26quot%3B%26gt%3BGriffiths%2C%20T.%20L.%2C%20and%20M.%20Steyvers.%20%26%23x201C%3BFinding%20Scientific%20Topics.%26%23x201D%3B%20%26lt%3Bi%26gt%3BProceedings%20of%20the%20National%20Academy%20of%20Sciences%26lt%3B%5C%2Fi%26gt%3B%20101%2C%20no.%20Supplement%201%20%282004%29%3A%205228%26%23x2013%3B35.%20%26lt%3Ba%20class%3D%26%23039%3Bzp-DOIURL%26%23039%3B%20href%3D%26%23039%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1073%5C%2Fpnas.0307752101%26%23039%3B%26gt%3Bhttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1073%5C%2Fpnas.0307752101%26lt%3B%5C%2Fa%26gt%3B.%20%26lt%3Ba%20title%3D%26%23039%3BCite%20in%20RIS%20Format%26%23039%3B%20class%3D%26%23039%3Bzp-CiteRIS%26%23039%3B%20data-zp-cite%3D%26%23039%3Bapi_user_id%3D2133649%26amp%3Bitem_key%3DMCBIJE58%26%23039%3B%20href%3D%26%23039%3Bjavascript%3Avoid%280%29%3B%26%23039%3B%26gt%3BCite%26lt%3B%5C%2Fa%26gt%3B%20%26lt%3B%5C%2Fdiv%26gt%3B%5Cn%26lt%3B%5C%2Fdiv%26gt%3B%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Finding%20scientific%20topics%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22T.%20L.%22%2C%22lastName%22%3A%22Griffiths%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22M.%22%2C%22lastName%22%3A%22Steyvers%22%7D%5D%2C%22abstractNote%22%3A%22The%20plan%20of%20this%20article%20is%20as%20follows.%20In%20the%20next%20section%2C%20this%20article%20describes%20Latent%20Dirichlet%20Allocation%20and%20present%20a%20Markov%20chain%20Monte%20Carlo%20algorithm%20for%20inference%20in%20this%20model%2C%20illustrating%20the%20operation%20of%20our%20algorithm%20on%20a%20small%20dataset.%20It%20then%20applies%20an%20algorithm%20to%20a%20corpus%20consisting%20of%20abstracts%20from%20PNAS%20from%201991%20to%202001%2C%20determining%20the%20number%20of%20topics%20needed%20to%20account%20for%20the%20information%20contained%20in%20this%20corpus%20and%20extracting%20a%20set%20of%20topics.%20It%20uses%20these%20topics%20to%20illustrate%20the%20relationships%20between%20different%20scientific%20disciplines%2C%20assessing%20trends%20and%20%5Cu201chot%20topics%5Cu201d%20by%20analyzing%20topic%20dynamics%20and%20using%20the%20assignments%20of%20words%20to%20topics%20to%20highlight%20the%20semantic%20content%20of%20documents.%22%2C%22date%22%3A%222004%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1073%5C%2Fpnas.0307752101%22%2C%22ISSN%22%3A%220027-8424%2C%201091-6490%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fwww.pnas.org%5C%2Fcgi%5C%2Fdoi%5C%2F10.1073%5C%2Fpnas.0307752101%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A41Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20optimization%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%5D%7D

Kapadia, Shashank. “Evaluate Topic Models: Latent Dirichlet Allocation (LDA).” Medium, 2020. https://towardsdatascience.com/evaluate-topic-model-in-python-latent-dirichlet-allocation-lda-7d57484bb5d0. Cite

Wikipedia. Confusion Matrix, 2019. https://en.wikipedia.org/w/index.php?title=Confusion_matrix&oldid=881721342. Cite

Syed, S., and M. Spruit. “Selecting Priors for Latent Dirichlet Allocation.” In 2018 IEEE 12th International Conference on Semantic Computing (ICSC), 194–202, 2018. https://doi.org/10.1109/ICSC.2018.00035. Cite

George, Clint P., and Hani Doss. “Principled Selection of Hyperparameters in the Latent Dirichlet Allocation Model.” Journal of Machine Learning Research 18, no. 162 (2018): 1–38. http://jmlr.org/papers/v18/15-595.html. Cite

Narkhede, Sarang. Understanding Confusion Matrix, 2018. https://towardsdatascience.com/understanding-confusion-matrix-a9ad42dcfd62. Cite

Dewi, Andisa, and Kilian Thiel. “Topic Extraction: Optimizing the Number of Topics with the Elbow Method.” KNIME (blog), 2017. https://www.knime.com/blog/topic-extraction-optimizing-the-number-of-topics-with-the-elbow-method. Cite

Ellis, Peter. Cross-Validation of Topic Modelling, 2017. http://freerangestats.info/blog/2017/01/05/topic-model-cv.html. Cite

Wisdom, Alyssa. Topic Modeling, 2017. https://medium.com/square-corner-blog/topic-modeling-optimizing-for-human-interpretability-48a81f6ce0ed. Cite

Schofield, Alexandra, Laure Thompson, and David Mimno. “Quantifying the Effects of Text Duplication on Semantic Models.” In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, 2737–47. Copenhagen, Denmark: Association for Computational Linguistics, 2017. https://doi.org/10.18653/v1/D17-1290. Cite

Schöch, Christof. Topic Modeling with MALLET: Hyperparameter Optimization, 2016. https://dragonfly.hypotheses.org/1051. Cite

Soltoff, Benjamin. Text Analysis: Topic Modeling, 2016. https://cfss.uchicago.edu/fall2016/text02.html. Cite

Allahyari, Mehdi, and Krys Kochut. “Discovering Coherent Topics with Entity Topic Models.” In 2016 IEEE/WIC/ACM International Conference on Web Intelligence (WI), 26–33. Omaha, NE, USA: IEEE, 2016. https://doi.org/10.1109/WI.2016.0015. Cite

Alexander, Eric, and Michael Gleicher. “Task-Driven Comparison of Topic Models.” IEEE Transactions on Visualization and Computer Graphics 22, no. 1 (2016): 320–29. https://doi.org/10.1109/TVCG.2015.2467618. Cite

Murdock, Jaimie, and Colin Allen. “Visualization Techniques for Topic Model Checking.” In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, 4284–85. AAAI’15. Austin, Texas: AAAI Press, 2015. http://dl.acm.org/citation.cfm?id=2888116.2888368. Cite

Chuang, Jason, Margaret E. Roberts, Brandon M. Stewart, Rebecca Weiss, Dustin Tingley, Justin Grimmer, and Jeffrey Heer. “TopicCheck: Interactive Alignment for Assessing Topic Model Stability.” In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 175–84. Denver: Association for Computational Linguistics, 2015. https://doi.org/10.3115/v1/N15-1018. Cite

Boyd-Graber, Jordan, David Mimno, and David Newman. “Care and Feeding of Topic Models: Problems, Diagnostics, and Improvements.” Handbook of Mixed Membership Models and Their Applications, 2014. https://doi.org/10.1201/b17520-21. Cite

Sievert, Carson, and Kenneth R. Shirley. “LDAvis : A Method for Visualizing and Interpreting Topics.” In Proceedings of the Workshop on Interactive Language Learning, Visualization, and Interfaces, 63–70. Association for Computational Linguistics, 2014. http://www.aclweb.org/anthology/W14-3110. Cite

Evans, Michael S. “A Computational Approach to Qualitative Analysis in Large Textual Datasets.” PLOS ONE 9, no. 2 (2014): e87908. https://doi.org/10.1371/journal.pone.0087908. Cite

Chuang, Jason, Sonal Gupta, Christopher D. Manning, and Jeffrey Heer. “Topic Model Diagnostics: Assessing Domain Relevance via Topical Alignment.” In Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28, III-612-III–620. ICML’13. Atlanta, GA, USA: JMLR.org, 2013. http://dl.acm.org/citation.cfm?id=3042817.3043005. Cite

Chen, Zhiyuan, Arjun Mukherjee, Bing Liu, Meichun Hsu, Malu Castellanos, and Riddhiman Ghosh. “Discovering Coherent Topics Using General Knowledge.” In Proceedings of the 22Nd ACM International Conference on Information & Knowledge Management, 209–18. CIKM ’13. New York, NY, USA: ACM, 2013. https://doi.org/10.1145/2505515.2505519. Cite

Chuang, Jason, Daniel Ramage, Christopher Manning, and Jeffrey Heer. “Interpretation and Trust: Designing Model-Driven Visualizations for Text Analysis.” In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 443–52. CHI ’12. New York, NY, USA: ACM, 2012. https://doi.org/10.1145/2207676.2207738. Cite

Ponweiser, Martin. “Latent Dirichlet Allocation in R.” Diploma Thesis, Vienna University of Economics and Business, 2012. http://epub.wu.ac.at/3558/. Cite

Matt Taddy. “On Estimation and Selection for Topic Models.” In Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, edited by Neil D. Lawrence and Mark Girolami, 1184–93. PMLR, 2012. http://proceedings.mlr.press/v22/taddy12.html. Cite

Bischof, Jonathan M., and Edoardo M. Airoldi. “Summarizing Topical Content with Word Frequency and Exclusivity.” In Proceedings of the 29th International Coference on International Conference on Machine Learning, 9–16. ICML’12. USA: Omnipress, 2012. https://icml.cc/2012/papers/113.pdf. Cite

Mimno, David, and David Blei. “Bayesian Checking for Topic Models.” In Proceedings of the Conference on Empirical Methods in Natural Language Processing, 227–37. Association for Computational Linguistics, 2011. Cite

Gavagai (company). “The Advantage of Ethersource on the TOEFL Synonym Test Compared to Other Methods.” Gavagai (blog), 2011. https://www.gavagai.io/blog/2011/12/14/the-advantage-of-ethersource-on-the-toefl-synonym-test-compared-to-other-methods/. Cite

Ratinov, Lev, Dan Roth, Doug Downey, and Mike Anderson. “Local and Global Algorithms for Disambiguation to Wikipedia,” 1375–84, 2011. https://aclweb.org/anthology/papers/P/P11/P11-1138/. Cite

Arun, R., V. Suresh, C. E. Veni Madhavan, and M. N. Narasimha Murthy. “On Finding the Natural Number of Topics with Latent Dirichlet Allocation: Some Observations.” In Advances in Knowledge Discovery and Data Mining, edited by Mohammed J. Zaki, Jeffrey Xu Yu, B. Ravindran, and Vikram Pudi, 391–402. Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2010. Cite

Cao, Juan, Tian Xia, Jintao Li, Yongdong Zhang, and Sheng Tang. “A Density-Based Method for Adaptive LDA Model Selection.” Neurocomputing, Advances in Machine Learning and Computational Intelligence, 72, no. 7 (2009): 1775–81. https://doi.org/10.1016/j.neucom.2008.06.011. Cite

AlSumait, Loulwah, Daniel Barbará, James Gentle, and Carlotta Domeniconi. “Topic Significance Ranking of LDA Generative Models.” In Proceedings of the 2009th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I, 67–82. ECMLPKDD’09. Berlin, Heidelberg: Springer-Verlag, 2009. https://doi.org/10.1007/978-3-642-04180-8_22. Cite

Matveeva, Irina, Gina-anne Levow, Ayman Farahat, and Christiaan Royer. “Terms Representation with Generalized Latent Semantic Analysis.” In Recent Advances in Natural Language Processing IV: Selected Papers from RANLP 2005, 292:45–54. Amsterdam Studies in the Theory and History of Linguistic Science. Amsterdam; Philadelphia: John Benjamins, 2007. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.110.2216. Cite

Newman, David, Jey Han Lau, Karl Grieser, and Timothy Baldwin. “Automatic Evaluation of Topic Coherence.” In COLING-ACL 2006: 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics; 17 - 21 July 2006, Sydney, Australia; Proceedings of the Conference. Vol. 1, Vol. 1. Stroudsburg, PA: Association for Computational Linguistics, 2006. https://dl.acm.org/citation.cfm?id=1858011. Cite

Griffiths, T. L., and M. Steyvers. “Finding Scientific Topics.” Proceedings of the National Academy of Sciences 101, no. Supplement 1 (2004): 5228–35. https://doi.org/10.1073/pnas.0307752101. Cite