(all)
Global Humanities | History of Humanities | Liberal Arts | Humanities and Higher Education | Humanities as Research Activity | Humanities Teaching & Curricula | Humanities and the Sciences | Medical Humanities | Public Humanities | Humanities Advocacy | Humanities and Social Groups | Value of Humanities | Humanities and Economic Value | Humanities Funding | Humanities Statistics | Humanities Surveys | "Crisis" of the Humanities
Humanities Organizations: Humanities Councils (U.S.) | Government Agencies | Foundations | Scholarly Associations
Humanities in: Africa | Asia (East) | Asia (South) | Australasia | Europe | Latin America | Middle East | North America: Canada - Mexico - United States | Scandinavia | United Kingdom
(all)
Lists of News Sources | Databases with News Archives | History of Journalism | Journalism Studies | Journalism Statistics | Journalism Organizations | Student Journalism | Data Journalism | Media Frames (analyzing & changing media narratives using "frame theory") | Media Bias | Fake News | Journalism and Minorities | Journalism and Women | Press Freedom | News & Social Media
(all)
Corpus Representativeness
Comparison paradigms for idea of a corpus: Archives as Paradigm | Canons as Paradigm | Editions as Paradigm | Corpus Linguistics as Paradigm
(all)
Artificial Intelligence | Big Data | Data Mining | Data Notebooks (Jupyter Notebooks) | Data Visualization (see also Topic Model Visualizations) | Hierarchical Clustering | Interpretability & Explainability (see also Topic Model Interpretation) | Mapping | Natural Language Processing | Network Analysis | Open Science | Reporting & Documentation Methods | Reproducibility | Sentiment Analysis | Social Media Analysis | Statistical Methods | Text Analysis (see also Topic Modeling) | Text Classification | Wikification | Word Embedding & Vector Semantics
Topic Modeling (all)
Selected DH research and resources bearing on, or utilized by, the WE1S project.
(all)
Distant Reading | Cultural Analytics | | Sociocultural Approaches | Topic Modeling in DH | Non-consumptive Use
Searchable version of bibliography on Zotero site
For WE1S developers: Biblio style guide | Biblio collection form (suggest additions) | WE1S Bibliography Ontology Outline
2133649
Topic Model Multilingual
1
chicago-fullnote-bibliography
50
date
desc
year
1
1
1
2459
https://we1s.ucsb.edu/wp-content/plugins/zotpress/
%7B%22status%22%3A%22success%22%2C%22updateneeded%22%3Afalse%2C%22instance%22%3Afalse%2C%22meta%22%3A%7B%22request_last%22%3A0%2C%22request_next%22%3A0%2C%22used_cache%22%3Atrue%7D%2C%22data%22%3A%5B%7B%22key%22%3A%2247MMXKH4%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Hao%20and%20Paul%22%2C%22parsedDate%22%3A%222019%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EHao%2C%20Shudong%2C%20and%20Michael%20J.%20Paul.%20%26%23x201C%3BAn%20Empirical%20Study%20on%20Crosslingual%20Transfer%20in%20Probabilistic%20Topic%20Models.%26%23x201D%3B%20arXiv%2C%202019.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Fwww.semanticscholar.org%5C%2Fpaper%5C%2FAn-Empirical-Study-on-Crosslingual-Transfer-in-Hao-Paul%5C%2F958506be9d5789b48ab89e95b29f56701d45e46a%27%3Ehttps%3A%5C%2F%5C%2Fwww.semanticscholar.org%5C%2Fpaper%5C%2FAn-Empirical-Study-on-Crosslingual-Transfer-in-Hao-Paul%5C%2F958506be9d5789b48ab89e95b29f56701d45e46a%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D47MMXKH4%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22An%20Empirical%20Study%20on%20Crosslingual%20Transfer%20in%20Probabilistic%20Topic%20Models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Shudong%22%2C%22lastName%22%3A%22Hao%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michael%20J.%22%2C%22lastName%22%3A%22Paul%22%7D%5D%2C%22abstractNote%22%3A%22A%20systematical%20study%20of%20the%20knowledge%20transfer%20mechanisms%20behind%20different%20multilingual%20topic%20models%2C%20and%20through%20a%20broad%20set%20of%20experiments%20with%20four%20models%20on%20ten%20languages.%20It%20provides%20empirical%20insights%20that%20can%20inform%20the%20selection%20and%20future%20development%20of%20multilingual%20topic%20models.%22%2C%22date%22%3A%222019%22%2C%22proceedingsTitle%22%3A%22%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Fwww.semanticscholar.org%5C%2Fpaper%5C%2FAn-Empirical-Study-on-Crosslingual-Transfer-in-Hao-Paul%5C%2F958506be9d5789b48ab89e95b29f56701d45e46a%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A26%3A16Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%229KN7UHAI%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22creatorSummary%22%3A%22Krstovski%20et%20al.%22%2C%22parsedDate%22%3A%222017%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EKrstovski%2C%20Kriste%2C%20Michael%20J.%20Kurtz%2C%20David%20A.%20Smith%2C%20and%20Alberto%20Accomazzi.%20%26%23x201C%3BMultilingual%20Topic%20Models.%26%23x201D%3B%20%3Ci%3EArXiv%3A1712.06704%20%5BCs%2C%20Stat%5D%3C%5C%2Fi%3E%2C%202017.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1712.06704%27%3Ehttp%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1712.06704%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D9KN7UHAI%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Multilingual%20Topic%20Models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Kriste%22%2C%22lastName%22%3A%22Krstovski%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Michael%20J.%22%2C%22lastName%22%3A%22Kurtz%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%20A.%22%2C%22lastName%22%3A%22Smith%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Alberto%22%2C%22lastName%22%3A%22Accomazzi%22%7D%5D%2C%22abstractNote%22%3A%22Scientific%20publications%20have%20evolved%20several%20features%20for%20mitigating%20vocabulary%20mismatch%20when%20indexing%2C%20retrieving%2C%20and%20computing%20similarity%20between%20articles.%20These%20mitigation%20strategies%20range%20from%20simply%20focusing%20on%20high-value%20article%20sections%2C%20such%20as%20titles%20and%20abstracts%2C%20to%20assigning%20keywords%2C%20often%20from%20controlled%20vocabularies%2C%20either%20manually%20or%20through%20automatic%20annotation.%20Various%20document%20representation%20schemes%20possess%20different%20cost-benefit%20tradeoffs.%20In%20this%20paper%2C%20we%20propose%20to%20model%20different%20representations%20of%20the%20same%20article%20as%20translations%20of%20each%20other%2C%20all%20generated%20from%20a%20common%20latent%20representation%20in%20a%20multilingual%20topic%20model.%20We%20start%20with%20a%20methodological%20overview%20on%20latent%20variable%20models%20for%20parallel%20document%20representations%20that%20could%20be%20used%20across%20many%20information%20science%20tasks.%20We%20then%20show%20how%20solving%20the%20inference%20problem%20of%20mapping%20diverse%20representations%20into%20a%20shared%20topic%20space%20allows%20us%20to%20evaluate%20representations%20based%20on%20how%20topically%20similar%20they%20are%20to%20the%20original%20article.%20In%20addition%2C%20our%20proposed%20approach%20provides%20means%20to%20discover%20where%20different%20concept%20vocabularies%20require%20improvement.%22%2C%22date%22%3A%222017%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Farxiv.org%5C%2Fabs%5C%2F1712.06704%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222021-07-28T22%3A03%3A15Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%226YI6JWKD%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Guti%5Cu00e9rrez%20et%20al.%22%2C%22parsedDate%22%3A%222016%22%2C%22numChildren%22%3A1%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EGuti%26%23xE9%3Brrez%2C%20E.%20D.%2C%20Ekaterina%20Shutova%2C%20Patricia%20Lichtenstein%2C%20Gerard%20de%20Melo%2C%20and%20Luca%20Gilardi.%20%26%23x201C%3BDetecting%20Cross-Cultural%20Differences%20Using%20a%20Multilingual%20Topic%20Model.%26%23x201D%3B%20%3Ci%3ETransactions%20of%20the%20Association%20for%20Computational%20Linguistics%3C%5C%2Fi%3E%204%20%282016%29%3A%2047%26%23x2013%3B60.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Ftacl2013.cs.columbia.edu%5C%2Fojs%5C%2Findex.php%5C%2Ftacl%5C%2Farticle%5C%2Fview%5C%2F755%27%3Ehttps%3A%5C%2F%5C%2Ftacl2013.cs.columbia.edu%5C%2Fojs%5C%2Findex.php%5C%2Ftacl%5C%2Farticle%5C%2Fview%5C%2F755%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3D6YI6JWKD%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Detecting%20Cross-cultural%20Differences%20Using%20a%20Multilingual%20Topic%20Model%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22E.%20D.%22%2C%22lastName%22%3A%22Guti%5Cu00e9rrez%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ekaterina%22%2C%22lastName%22%3A%22Shutova%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Patricia%22%2C%22lastName%22%3A%22Lichtenstein%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Gerard%22%2C%22lastName%22%3A%22de%20Melo%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Luca%22%2C%22lastName%22%3A%22Gilardi%22%7D%5D%2C%22abstractNote%22%3A%22Understanding%20cross-cultural%20differences%20has%20important%20implications%20for%20world%20affairs%20and%20many%20aspects%20of%20the%20life%20of%20society.%20Yet%2C%20the%20majority%20of%20text-mining%20methods%20to%20date%20focus%20on%20the%20analysis%20of%20monolingual%20texts.%20In%20contrast%2C%20this%20paper%20presents%20a%20statistical%20model%20that%20simultaneously%20learns%20a%20set%20of%20common%20topics%20from%20multilingual%2C%20non-parallel%20data%20and%20automatically%20discovers%20the%20differences%20in%20perspectives%20in%20these%20topics%20across%20linguistic%20communities.%20This%20paper%20performs%20a%20behavioural%20evaluation%20of%20a%20subset%20of%20the%20differences%20identified%20by%20our%20model%20in%20English%20and%20Spanish%20to%20investigate%20their%20psychological%20validity.%22%2C%22date%22%3A%222016%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%222307-387X%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Ftacl2013.cs.columbia.edu%5C%2Fojs%5C%2Findex.php%5C%2Ftacl%5C%2Farticle%5C%2Fview%5C%2F755%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A42Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22MHDFCTF5%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Vuli%5Cu0107%20et%20al.%22%2C%22parsedDate%22%3A%222015%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EVuli%26%23x107%3B%2C%20Ivan%2C%20Wim%20De%20Smet%2C%20Jie%20Tang%2C%20and%20Marie-Francine%20Moens.%20%26%23x201C%3BProbabilistic%20Topic%20Modeling%20in%20Multilingual%20Settings%3A%20An%20Overview%20of%20Its%20Methodology%20and%20Applications.%26%23x201D%3B%20%3Ci%3EInformation%20Processing%20%26amp%3B%20Management%3C%5C%2Fi%3E%2051%2C%20no.%201%20%282015%29%3A%20111%26%23x2013%3B47.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Fkeg.cs.tsinghua.edu.cn%5C%2Fjietang%5C%2Fpublications%5C%2Fipm15-xLiTe-IvanVulic-overview-topic-model-multilingual.pdf%27%3Ehttp%3A%5C%2F%5C%2Fkeg.cs.tsinghua.edu.cn%5C%2Fjietang%5C%2Fpublications%5C%2Fipm15-xLiTe-IvanVulic-overview-topic-model-multilingual.pdf%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DMHDFCTF5%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22journalArticle%22%2C%22title%22%3A%22Probabilistic%20topic%20modeling%20in%20multilingual%20settings%3A%20An%20overview%20of%20its%20methodology%20and%20applications%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ivan%22%2C%22lastName%22%3A%22Vuli%5Cu0107%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Wim%22%2C%22lastName%22%3A%22De%20Smet%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jie%22%2C%22lastName%22%3A%22Tang%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marie-Francine%22%2C%22lastName%22%3A%22Moens%22%7D%5D%2C%22abstractNote%22%3A%22Probabilistic%20topic%20models%20are%20unsupervised%20generative%20models%20which%20model%20document%20content%20as%20a%20two-step%20generation%20process%2C%20that%20is%2C%20documents%20are%20observed%20as%20mixtures%20of%20latent%20concepts%20or%20topics%2C%20while%20topics%20are%20probability%20distributions%20over%20vocabulary%20words.%20Recently%2C%20a%20significant%20research%20effort%20has%20been%20invested%20into%20transferring%20the%20probabilistic%20topic%20modeling%20concept%20from%20monolingual%20to%20multilingual%20settings.%20Novel%20topic%20models%20have%20been%20designed%20to%20work%20with%20parallel%20and%20comparable%20texts.%20We%20define%20multilingual%20probabilistic%20topic%20modeling%20%28MuPTM%29%20and%20present%20the%20first%20full%20overview%20of%20the%20current%20research%2C%20methodology%2C%20advantages%20and%20limitations%20in%20MuPTM.%20As%20a%20representative%20example%2C%20we%20choose%20a%20natural%20extension%20of%20the%20omnipresent%20LDA%20model%20to%20multilingual%20settings%20called%20bilingual%20LDA%20%28BiLDA%29.%20We%20provide%20a%20thorough%20overview%20of%20this%20representative%20multilingual%20model%20from%20its%20high-level%20modeling%20assumptions%20down%20to%20its%20mathematical%20foundations.%20We%20demonstrate%20how%20to%20use%20the%20data%20representation%20by%20means%20of%20output%20sets%20of%20%28i%29%20per-topic%20word%20distributions%20and%20%28ii%29%20per-document%20topic%20distributions%20coming%20from%20a%20multilingual%20probabilistic%20topic%20model%20in%20various%20real-life%20cross-lingual%20tasks%20involving%20different%20languages%2C%20without%20any%20external%20language%20pair%20dependent%20translation%20resource%3A%20%281%29%20cross-lingual%20event-centered%20news%20clustering%2C%20%282%29%20cross-lingual%20document%20classification%2C%20%283%29%20cross-lingual%20semantic%20similarity%2C%20and%20%284%29%20cross-lingual%20information%20retrieval.%20We%20also%20briefly%20review%20several%20other%20applications%20present%20in%20the%20relevant%20literature%2C%20and%20introduce%20and%20illustrate%20two%20related%20modeling%20concepts%3A%20topic%20smoothing%20and%20topic%20pruning.%20In%20summary%2C%20this%20article%20encompasses%20the%20current%20research%20in%20multilingual%20probabilistic%20topic%20modeling.%20By%20presenting%20a%20series%20of%20potential%20applications%2C%20we%20reveal%20the%20importance%20of%20the%20language-independent%20and%20language%20pair%20independent%20data%20representations%20by%20means%20of%20MuPTM.%20We%20provide%20clear%20directions%20for%20future%20research%20in%20the%20field%20by%20providing%20a%20systematic%20overview%20of%20how%20to%20link%20and%20transfer%20aspect%20knowledge%20across%20corpora%20written%20in%20different%20languages%20via%20the%20shared%20space%20of%20latent%20cross-lingual%20topics%2C%20that%20is%2C%20how%20to%20effectively%20employ%20learned%20per-topic%20word%20distributions%20and%20per-document%20topic%20distributions%20of%20any%20multilingual%20probabilistic%20topic%20model%20in%20various%20cross-lingual%20applications%22%2C%22date%22%3A%222015%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISSN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fkeg.cs.tsinghua.edu.cn%5C%2Fjietang%5C%2Fpublications%5C%2Fipm15-xLiTe-IvanVulic-overview-topic-model-multilingual.pdf%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A44%3A53Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20algorithm%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22ZP7GA9B9%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Moens%20and%20Vuli%5Cu0107%22%2C%22parsedDate%22%3A%222013%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EMoens%2C%20Marie-Francine%2C%20and%20Ivan%20Vuli%26%23x107%3B.%20%26%23x201C%3BMonolingual%20and%20Cross-Lingual%20Probabilistic%20Topic%20Models%20and%20Their%20Applications%20in%20Information%20Retrieval.%26%23x201D%3B%20In%20%3Ci%3EAdvances%20in%20Information%20Retrieval%3C%5C%2Fi%3E%2C%20edited%20by%20David%20Hutchison%2C%20Takeo%20Kanade%2C%20Josef%20Kittler%2C%20Jon%20M.%20Kleinberg%2C%20Friedemann%20Mattern%2C%20John%20C.%20Mitchell%2C%20Moni%20Naor%2C%20et%20al.%2C%207814%3A874%26%23x2013%3B77.%20Berlin%2C%20Heidelberg%3A%20Springer%20Berlin%20Heidelberg%2C%202013.%20https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-3-642-36973-5_106.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DZP7GA9B9%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22bookSection%22%2C%22title%22%3A%22Monolingual%20and%20Cross-Lingual%20Probabilistic%20Topic%20Models%20and%20Their%20Applications%20in%20Information%20Retrieval%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Marie-Francine%22%2C%22lastName%22%3A%22Moens%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Ivan%22%2C%22lastName%22%3A%22Vuli%5Cu0107%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Hutchison%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Takeo%22%2C%22lastName%22%3A%22Kanade%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Josef%22%2C%22lastName%22%3A%22Kittler%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Jon%20M.%22%2C%22lastName%22%3A%22Kleinberg%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Friedemann%22%2C%22lastName%22%3A%22Mattern%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22John%20C.%22%2C%22lastName%22%3A%22Mitchell%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Moni%22%2C%22lastName%22%3A%22Naor%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Oscar%22%2C%22lastName%22%3A%22Nierstrasz%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22C.%22%2C%22lastName%22%3A%22Pandu%20Rangan%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Bernhard%22%2C%22lastName%22%3A%22Steffen%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Madhu%22%2C%22lastName%22%3A%22Sudan%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Demetri%22%2C%22lastName%22%3A%22Terzopoulos%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Doug%22%2C%22lastName%22%3A%22Tygar%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Moshe%20Y.%22%2C%22lastName%22%3A%22Vardi%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Gerhard%22%2C%22lastName%22%3A%22Weikum%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Pavel%22%2C%22lastName%22%3A%22Serdyukov%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Pavel%22%2C%22lastName%22%3A%22Braslavski%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Sergei%20O.%22%2C%22lastName%22%3A%22Kuznetsov%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Jaap%22%2C%22lastName%22%3A%22Kamps%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Stefan%22%2C%22lastName%22%3A%22R%5Cu00fcger%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Eugene%22%2C%22lastName%22%3A%22Agichtein%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Ilya%22%2C%22lastName%22%3A%22Segalovich%22%7D%2C%7B%22creatorType%22%3A%22editor%22%2C%22firstName%22%3A%22Emine%22%2C%22lastName%22%3A%22Yilmaz%22%7D%5D%2C%22abstractNote%22%3A%22Significant%20research%20effort%20has%20been%20invested%20into%20transferring%20the%20probabilistic%20topic%20modeling%20concept%20from%20monolingual%20to%20multilingual%20settings.%20Novel%20topic%20models%20have%20been%20designed%20to%20work%20with%20parallel%20and%20comparable%20multilingual%20data%20%28e.g.%2C%20Wikipedia%20or%20news%20data%20discussing%20the%20same%20events%29.%20Probabilistic%20topics%20are%20an%20easy%20integration%20into%20a%20language%20modeling%20framework%20for%20monolingual%20and%20crosslingual%20information%20retrieval.%20Moreover%2C%20this%20paper%20presents%20how%20to%20use%20the%20knowledge%20from%20the%20topic%20models%20in%20the%20tasks%20of%20cross-lingual%20event%20clustering%2C%20cross-lingual%20document%20classification%20and%20the%20detection%20of%20cross-lingual%20semantic%20similarity%20of%20words.%20The%20tutorial%20also%20demonstrates%20how%20semantically%20similar%20words%20across%20languages%20are%20integrated%20as%20useful%20additional%20evidences%20in%20cross-lingual%20information.%22%2C%22bookTitle%22%3A%22Advances%20in%20Information%20Retrieval%22%2C%22date%22%3A%222013%22%2C%22language%22%3A%22en%22%2C%22ISBN%22%3A%22978-3-642-36972-8%20978-3-642-36973-5%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Flink.springer.com%5C%2F10.1007%5C%2F978-3-642-36973-5_106%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A44%3A01Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22GUGUKGGZ%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Jagarlamudi%20and%20Daum%5Cu00e9%22%2C%22parsedDate%22%3A%222010%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EJagarlamudi%2C%20Jagadeesh%2C%20and%20Hal%20Daum%26%23xE9%3B.%20%26%23x201C%3BExtracting%20Multilingual%20Topics%20from%20Unaligned%20Comparable%20Corpora.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%2032Nd%20European%20Conference%20on%20Advances%20in%20Information%20Retrieval%3C%5C%2Fi%3E%2C%20444%26%23x2013%3B56.%20ECIR%26%23x2019%3B2010.%20Berlin%2C%20Heidelberg%3A%20Springer-Verlag%2C%202010.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-3-642-12275-0_39%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1007%5C%2F978-3-642-12275-0_39%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DGUGUKGGZ%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Extracting%20Multilingual%20Topics%20from%20Unaligned%20Comparable%20Corpora%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jagadeesh%22%2C%22lastName%22%3A%22Jagarlamudi%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Hal%22%2C%22lastName%22%3A%22Daum%5Cu00e9%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20presents%20a%20generative%20model%20called%20JointLDA%20which%20uses%20a%20bilingual%20dictionary%20to%20mine%20multilingual%20topics%20from%20an%20unaligned%20corpus.%20Experiments%20conducted%20on%20different%20data%20sets%20confirm%20our%20conjecture%20that%20jointly%20modeling%20the%20cross-lingual%20corpora%20offers%20several%20advantages%20compared%20to%20individual%20monolingual%20models.%20Since%20the%20JointLDA%20model%20merges%20related%20topics%20in%20different%20languages%20into%20a%20single%20multilingual%20topic%3A%20a%29%20it%20can%20fit%20the%20data%20with%20relatively%20fewer%20topics.%20b%29%20it%20has%20the%20ability%20to%20predict%20related%20words%20from%20a%20language%20different%20than%20that%20of%20the%20given%20document.%20In%20fact%20it%20has%20better%20predictive%20power%20compared%20to%20the%20bag-of-word%20based%20translation%20model%20leaving%20the%20possibility%20for%20JointLDA%20to%20be%20preferred%20over%20bag-of-word%20model%20for%20Cross-Lingual%20IR%20applications.%20Furthermore%2C%20monolingual%20models%20learnt%20while%20optimizing%20the%20cross-lingual%20copora%20are%20more%20effective%20than%20the%20corresponding%20LDA%20models.%22%2C%22date%22%3A%222010%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%2032Nd%20European%20Conference%20on%20Advances%20in%20Information%20Retrieval%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1007%5C%2F978-3-642-12275-0_39%22%2C%22ISBN%22%3A%22978-3-642-12274-3%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdx.doi.org%5C%2F10.1007%5C%2F978-3-642-12275-0_39%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A43Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22H6QJSRSF%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Boyd-Graber%20and%20Resnik%22%2C%22parsedDate%22%3A%222010%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EBoyd-Graber%2C%20Jordan%2C%20and%20Philip%20Resnik.%20%26%23x201C%3BHolistic%20Sentiment%20Analysis%20across%20Languages%3A%20Multilingual%20Supervised%20Latent%20Dirichlet%20Allocation.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%202010%20Conference%20on%20Empirical%20Methods%20in%20Natural%20Language%20Processing%3C%5C%2Fi%3E%2C%2045%26%23x2013%3B55.%20Association%20for%20Computational%20Linguistics%2C%202010.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Fwww.aclweb.org%5C%2Fanthology%5C%2FD10-1005%27%3Ehttp%3A%5C%2F%5C%2Fwww.aclweb.org%5C%2Fanthology%5C%2FD10-1005%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DH6QJSRSF%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Holistic%20sentiment%20analysis%20across%20languages%3A%20Multilingual%20supervised%20latent%20Dirichlet%20allocation%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jordan%22%2C%22lastName%22%3A%22Boyd-Graber%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Philip%22%2C%22lastName%22%3A%22Resnik%22%7D%5D%2C%22abstractNote%22%3A%22This%20paper%20develops%20multilingual%20supervised%20latent%20Dirichlet%20allocation%20%28MlSLDA%29%2C%20a%20probabilistic%20generative%20model%20that%20allows%20insights%20gleaned%20from%20one%20language%27s%20data%20to%20inform%20how%20the%20model%20captures%20properties%20of%20other%20languages.%20This%20work%20shows%20MlSLDA%20can%20build%20topics%20that%20are%20consistent%20across%20languages%2C%20discover%20sensible%20bilingual%20lexical%20correspondences%2C%20and%20leverage%20multilingual%20corpora%20to%20better%20predict%20sentiment.%22%2C%22date%22%3A%222010%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%202010%20Conference%20on%20Empirical%20Methods%20in%20Natural%20Language%20Processing%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fwww.aclweb.org%5C%2Fanthology%5C%2FD10-1005%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A33%3A38Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Natural%20language%20processing%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20algorithm%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22MMXU69R7%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Boyd-Graber%20and%20Blei%22%2C%22parsedDate%22%3A%222009%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EBoyd-Graber%2C%20Jordan%2C%20and%20David%20M.%20Blei.%20%26%23x201C%3BMultilingual%20Topic%20Models%20for%20Unaligned%20Text.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%20Twenty-Fifth%20Conference%20on%20Uncertainty%20in%20Artificial%20Intelligence%3C%5C%2Fi%3E%2C%2075%26%23x2013%3B82.%20AUAI%20Press%2C%202009.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27https%3A%5C%2F%5C%2Farxiv.org%5C%2Fpdf%5C%2F1205.2657%27%3Ehttps%3A%5C%2F%5C%2Farxiv.org%5C%2Fpdf%5C%2F1205.2657%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DMMXU69R7%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Multilingual%20topic%20models%20for%20unaligned%20text%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jordan%22%2C%22lastName%22%3A%22Boyd-Graber%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%20M.%22%2C%22lastName%22%3A%22Blei%22%7D%5D%2C%22abstractNote%22%3A%22The%20multilingual%20topic%20model%20for%20unaligned%20text%20%28MuTo%29%20is%20a%20probabilistic%20model%20of%20text%20that%20is%20designed%20to%20analyze%20corpora%20composed%20of%20documents%20in%20two%20languages.%20From%20these%20documents%2C%20MuTo%20uses%20stochastic%20EM%20to%20simultaneously%20discover%20both%20a%20matching%20between%20the%20languages%20and%20multilingual%20latent%20topics.%20This%20study%20demonstrates%20that%20MuTo%20is%20able%20to%20find%20shared%20topics%20on%20real-world%20multilingual%20corpora%2C%20successfully%20pairing%20related%20documents%20across%20languages.%20MuTo%20provides%20a%20new%20framework%20for%20creating%20multilingual%20topic%20models%20without%20needing%20carefully%20curated%20parallel%20corpora%20and%20allows%20applications%20built%20using%20the%20topic%20model%20formalism%20to%20be%22%2C%22date%22%3A%222009%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%20Twenty-Fifth%20Conference%20on%20Uncertainty%20in%20Artificial%20Intelligence%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22%22%2C%22url%22%3A%22https%3A%5C%2F%5C%2Farxiv.org%5C%2Fpdf%5C%2F1205.2657%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A44%3A05Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20algorithm%22%7D%2C%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22S7U7K7RH%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Mimno%20et%20al.%22%2C%22parsedDate%22%3A%222009%22%2C%22numChildren%22%3A0%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3EMimno%2C%20David%2C%20Hanna%20M.%20Wallach%2C%20Jason%20Naradowsky%2C%20David%20A.%20Smith%2C%20and%20Andrew%20McCallum.%20%26%23x201C%3BPolylingual%20Topic%20Models.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%202009%20Conference%20on%20Empirical%20Methods%20in%20Natural%20Language%20Processing%3A%20Volume%202%20-%20Volume%202%3C%5C%2Fi%3E%2C%20880%26%23x2013%3B89.%20EMNLP%20%26%23x2019%3B09.%20Stroudsburg%2C%20PA%2C%20USA%3A%20Association%20for%20Computational%20Linguistics%2C%202009.%20%3Ca%20class%3D%27zp-ItemURL%27%20href%3D%27http%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D1699571.1699627%27%3Ehttp%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D1699571.1699627%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DS7U7K7RH%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Polylingual%20Topic%20Models%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%22%2C%22lastName%22%3A%22Mimno%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Hanna%20M.%22%2C%22lastName%22%3A%22Wallach%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Jason%22%2C%22lastName%22%3A%22Naradowsky%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22David%20A.%22%2C%22lastName%22%3A%22Smith%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Andrew%22%2C%22lastName%22%3A%22McCallum%22%7D%5D%2C%22abstractNote%22%3A%22Topic%20models%20are%20a%20useful%20tool%20for%20analyzing%20large%20text%20collections%2C%20but%20have%20previously%20been%20applied%20in%20only%20monolingual%2C%20or%20at%20most%20bilingual%2C%20contexts.%20Meanwhile%2C%20massive%20collections%20of%20interlinked%20documents%20in%20dozens%20of%20languages%2C%20such%20as%20Wikipedia%2C%20are%20now%20widely%20available%2C%20calling%20for%20tools%20that%20can%20characterize%20content%20in%20many%20languages.%20This%20paper%20introduces%20a%20polylingual%20topic%20model%20that%20discovers%20topics%20aligned%20across%20multiple%20languages.%20It%20explores%20the%20model%27s%20characteristics%20using%20two%20large%20corpora%2C%20each%20with%20over%20ten%20different%20languages%2C%20and%20it%20demonstrates%20its%20usefulness%20in%20supporting%20machine%20translation%20and%20tracking%20topic%20trends%20across%20languages.%22%2C%22date%22%3A%222009%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%202009%20Conference%20on%20Empirical%20Methods%20in%20Natural%20Language%20Processing%3A%20Volume%202%20-%20Volume%202%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%22%22%2C%22ISBN%22%3A%22978-1-932432-62-6%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdl.acm.org%5C%2Fcitation.cfm%3Fid%3D1699571.1699627%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222020-07-01T20%3A53%3A02Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%2C%7B%22key%22%3A%22HU6VNENC%22%2C%22library%22%3A%7B%22id%22%3A2133649%7D%2C%22meta%22%3A%7B%22lastModifiedByUser%22%3A%7B%22id%22%3A22837%2C%22username%22%3A%22ayliu%22%2C%22name%22%3A%22Alan%20Liu%22%2C%22links%22%3A%7B%22alternate%22%3A%7B%22href%22%3A%22https%3A%5C%2F%5C%2Fwww.zotero.org%5C%2Fayliu%22%2C%22type%22%3A%22text%5C%2Fhtml%22%7D%7D%7D%2C%22creatorSummary%22%3A%22Larkey%20et%20al.%22%2C%22parsedDate%22%3A%222004%22%2C%22numChildren%22%3A2%7D%2C%22bib%22%3A%22%3Cdiv%20class%3D%5C%22csl-bib-body%5C%22%20style%3D%5C%22line-height%3A%201.35%3B%20padding-left%3A%201em%3B%20text-indent%3A-1em%3B%5C%22%3E%5Cn%20%20%3Cdiv%20class%3D%5C%22csl-entry%5C%22%3ELarkey%2C%20Leah%20S.%2C%20Fangfang%20Feng%2C%20Margaret%20Connell%2C%20and%20Victor%20Lavrenko.%20%26%23x201C%3BLanguage-Specific%20Models%20in%20Multilingual%20Topic%20Tracking.%26%23x201D%3B%20In%20%3Ci%3EProceedings%20of%20the%2027th%20Annual%20International%20ACM%20SIGIR%20Conference%20on%20Research%20and%20Development%20in%20Information%20Retrieval%3C%5C%2Fi%3E%2C%20402%26%23x2013%3B9.%20SIGIR%20%26%23x2019%3B04.%20New%20York%2C%20NY%2C%20USA%3A%20ACM%2C%202004.%20%3Ca%20class%3D%27zp-DOIURL%27%20href%3D%27https%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F1008992.1009061%27%3Ehttps%3A%5C%2F%5C%2Fdoi.org%5C%2F10.1145%5C%2F1008992.1009061%3C%5C%2Fa%3E.%20%3Ca%20title%3D%27Cite%20in%20RIS%20Format%27%20class%3D%27zp-CiteRIS%27%20href%3D%27https%3A%5C%2F%5C%2Fwe1s.ucsb.edu%5C%2Fwp-content%5C%2Fplugins%5C%2Fzotpress%5C%2Flib%5C%2Frequest%5C%2Frequest.cite.php%3Fapi_user_id%3D2133649%26amp%3Bitem_key%3DHU6VNENC%27%3ECite%3C%5C%2Fa%3E%20%3C%5C%2Fdiv%3E%5Cn%3C%5C%2Fdiv%3E%22%2C%22data%22%3A%7B%22itemType%22%3A%22conferencePaper%22%2C%22title%22%3A%22Language-specific%20Models%20in%20Multilingual%20Topic%20Tracking%22%2C%22creators%22%3A%5B%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Leah%20S.%22%2C%22lastName%22%3A%22Larkey%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Fangfang%22%2C%22lastName%22%3A%22Feng%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Margaret%22%2C%22lastName%22%3A%22Connell%22%7D%2C%7B%22creatorType%22%3A%22author%22%2C%22firstName%22%3A%22Victor%22%2C%22lastName%22%3A%22Lavrenko%22%7D%5D%2C%22abstractNote%22%3A%22Researchers%20have%20trained%20only%20English%20topic%20models%20because%20the%20training%20stories%20have%20been%20provided%20in%20English%20and%20tracking%20can%20be%20complicated%20when%20the%20stories%20are%20in%20multiple%20languages.%20In%20tracking%2C%20non-English%20test%20stories%20are%20then%20machine%20translated%20into%20English%20to%20compare%20them%20with%20the%20topic%20models.%20In%20this%20paper%20the%20authors%20propose%20a%20native%20language%20hypothesis%20stating%20that%20comparisons%20would%20be%20more%20effective%20in%20the%20original%20language%20of%20the%20story.%20First%2C%20they%20test%20and%20support%20the%20hypothesis%20for%20story%20link%20detection.%20For%20topic%20tracking%20the%20hypothesis%20implies%20that%20it%20should%20be%20preferable%20to%20build%20separate%20language-specific%20topic%20models%20for%20each%20language%20in%20the%20stream.%20Different%20methods%20of%20incrementally%20building%20such%20native%20language%20topic%20models%20are%20then%20compared.%22%2C%22date%22%3A%222004%22%2C%22proceedingsTitle%22%3A%22Proceedings%20of%20the%2027th%20Annual%20International%20ACM%20SIGIR%20Conference%20on%20Research%20and%20Development%20in%20Information%20Retrieval%22%2C%22conferenceName%22%3A%22%22%2C%22language%22%3A%22en%22%2C%22DOI%22%3A%2210.1145%5C%2F1008992.1009061%22%2C%22ISBN%22%3A%22978-1-58113-881-8%22%2C%22url%22%3A%22http%3A%5C%2F%5C%2Fdoi.acm.org%5C%2F10.1145%5C%2F1008992.1009061%22%2C%22collections%22%3A%5B%5D%2C%22dateModified%22%3A%222019-07-27T21%3A43%3A09Z%22%2C%22tags%22%3A%5B%7B%22tag%22%3A%22Topic%20model%20multilingual%22%7D%2C%7B%22tag%22%3A%22Topic%20modeling%22%7D%5D%7D%7D%5D%7D
Hao, Shudong, and Michael J. Paul. “An Empirical Study on Crosslingual Transfer in Probabilistic Topic Models.” arXiv, 2019. https://www.semanticscholar.org/paper/An-Empirical-Study-on-Crosslingual-Transfer-in-Hao-Paul/958506be9d5789b48ab89e95b29f56701d45e46a. Cite
Krstovski, Kriste, Michael J. Kurtz, David A. Smith, and Alberto Accomazzi. “Multilingual Topic Models.” ArXiv:1712.06704 [Cs, Stat], 2017. http://arxiv.org/abs/1712.06704. Cite
Gutiérrez, E. D., Ekaterina Shutova, Patricia Lichtenstein, Gerard de Melo, and Luca Gilardi. “Detecting Cross-Cultural Differences Using a Multilingual Topic Model.” Transactions of the Association for Computational Linguistics 4 (2016): 47–60. https://tacl2013.cs.columbia.edu/ojs/index.php/tacl/article/view/755. Cite
Vulić, Ivan, Wim De Smet, Jie Tang, and Marie-Francine Moens. “Probabilistic Topic Modeling in Multilingual Settings: An Overview of Its Methodology and Applications.” Information Processing & Management 51, no. 1 (2015): 111–47. http://keg.cs.tsinghua.edu.cn/jietang/publications/ipm15-xLiTe-IvanVulic-overview-topic-model-multilingual.pdf. Cite
Moens, Marie-Francine, and Ivan Vulić. “Monolingual and Cross-Lingual Probabilistic Topic Models and Their Applications in Information Retrieval.” In Advances in Information Retrieval, edited by David Hutchison, Takeo Kanade, Josef Kittler, Jon M. Kleinberg, Friedemann Mattern, John C. Mitchell, Moni Naor, et al., 7814:874–77. Berlin, Heidelberg: Springer Berlin Heidelberg, 2013. https://doi.org/10.1007/978-3-642-36973-5_106. Cite
Jagarlamudi, Jagadeesh, and Hal Daumé. “Extracting Multilingual Topics from Unaligned Comparable Corpora.” In Proceedings of the 32Nd European Conference on Advances in Information Retrieval, 444–56. ECIR’2010. Berlin, Heidelberg: Springer-Verlag, 2010. https://doi.org/10.1007/978-3-642-12275-0_39. Cite
Boyd-Graber, Jordan, and Philip Resnik. “Holistic Sentiment Analysis across Languages: Multilingual Supervised Latent Dirichlet Allocation.” In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, 45–55. Association for Computational Linguistics, 2010. http://www.aclweb.org/anthology/D10-1005. Cite
Boyd-Graber, Jordan, and David M. Blei. “Multilingual Topic Models for Unaligned Text.” In Proceedings of the Twenty-Fifth Conference on Uncertainty in Artificial Intelligence, 75–82. AUAI Press, 2009. https://arxiv.org/pdf/1205.2657. Cite
Mimno, David, Hanna M. Wallach, Jason Naradowsky, David A. Smith, and Andrew McCallum. “Polylingual Topic Models.” In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2, 880–89. EMNLP ’09. Stroudsburg, PA, USA: Association for Computational Linguistics, 2009. http://dl.acm.org/citation.cfm?id=1699571.1699627. Cite
Larkey, Leah S., Fangfang Feng, Margaret Connell, and Victor Lavrenko. “Language-Specific Models in Multilingual Topic Tracking.” In Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 402–9. SIGIR ’04. New York, NY, USA: ACM, 2004. https://doi.org/10.1145/1008992.1009061. Cite