TY - GEN
T1 - Misinformation Detection Algorithms and Fairness across Political Ideologies
T2 - 15th ACM Web Science Conference, WebSci 2023
AU - Park, Jinkyung
AU - Ellezhuthil, Rahul Dev
AU - Isaac, Joseph
AU - Mergerson, Christoph
AU - Feldman, Lauren
AU - Singh, Vivek
N1 - Funding Information:
This material is in part based upon work supported by the National Science Foundation under Grant No. SES-1915790.
Publisher Copyright:
© 2023 ACM.
PY - 2023/4/30
Y1 - 2023/4/30
N2 - Multiple recent efforts have used large-scale data and computational models to automatically detect misinformation in online news articles. Given the potential impact of misinformation on democracy, many of these efforts have also used the political ideology of these articles to better model misinformation and study political bias in such algorithms. However, almost all such efforts have used source level labels for credibility and political alignment, thereby assigning the same credibility and political alignment label to all articles from the same source (e.g., the New York Times or Breitbart). Here, we report on the impact of journalistic best practices to label individual news articles for their credibility and political alignment. We found that while source level labels are decent proxies for political alignment labeling, they are very poor proxies-almost the same as flipping a coin-for credibility ratings. Next, we study the implications of such source level labeling on downstream processes such as the development of automated misinformation detection algorithms and political fairness audits therein. We find that the automated misinformation detection and fairness algorithms can be suitably revised to support their intended goals but might require different assumptions and methods than those which are appropriate using source level labeling. The results suggest caution in generalizing recent results on misinformation detection and political bias therein. On a positive note, this work shares a new dataset of journalistic quality individually labeled articles and an approach for misinformation detection and fairness audits.
AB - Multiple recent efforts have used large-scale data and computational models to automatically detect misinformation in online news articles. Given the potential impact of misinformation on democracy, many of these efforts have also used the political ideology of these articles to better model misinformation and study political bias in such algorithms. However, almost all such efforts have used source level labels for credibility and political alignment, thereby assigning the same credibility and political alignment label to all articles from the same source (e.g., the New York Times or Breitbart). Here, we report on the impact of journalistic best practices to label individual news articles for their credibility and political alignment. We found that while source level labels are decent proxies for political alignment labeling, they are very poor proxies-almost the same as flipping a coin-for credibility ratings. Next, we study the implications of such source level labeling on downstream processes such as the development of automated misinformation detection algorithms and political fairness audits therein. We find that the automated misinformation detection and fairness algorithms can be suitably revised to support their intended goals but might require different assumptions and methods than those which are appropriate using source level labeling. The results suggest caution in generalizing recent results on misinformation detection and political bias therein. On a positive note, this work shares a new dataset of journalistic quality individually labeled articles and an approach for misinformation detection and fairness audits.
KW - algorithmic fairness
KW - article level labeling
KW - misinformation detection
KW - political bias
UR - http://www.scopus.com/inward/record.url?scp=85159226537&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85159226537&partnerID=8YFLogxK
U2 - 10.1145/3578503.3583617
DO - 10.1145/3578503.3583617
M3 - Conference contribution
AN - SCOPUS:85159226537
T3 - ACM International Conference Proceeding Series
SP - 107
EP - 116
BT - WebSci 2023 - Proceedings of the 15th ACM Web Science Conference
PB - Association for Computing Machinery
Y2 - 30 April 2023 through 1 May 2023
ER -