探討達成不同貼文類型適用的圖像替代文字描述方式研究/A study of the description of alt text suitable for different types of posts.

蔡涵如 Han-Ju Tsai, 何俊亨 Chun-Heng Ho

摘要


近年來社群軟體的使用率提高,圖像、影像逐漸取代文字為主要的社交物件。視障者也是社群軟體的高度使用者,當視障者在使用社群軟體時,會為沒有替代文字的圖片感到困擾。臉書近年推出自動替代文字系統,該系統仍然有許多可改善的空間。本研究探討另一項長期做為視障者提供視覺資訊的口述影像服務後,提出臉書貼文中的圖像替代文字應該因應貼文類型調整敘述方式。研究發現,由於相繼型的貼文在文字內容中,解釋圖像資訊的比例較少,因此貼文適合使用特定的圖像替代文字。而說明型貼文的文字內容中,已解釋許多圖像資訊,因此可以使用特定或抽象的圖像替代文字。以研究結果彙整成建議圖像替代文字產出流程圖,以提供未來相關研究或者相關服務參考。


Recently, images and videos have replaced text as the main social objects on social software. When the visually impaired use social software, they will be troubled by pictures without alt text. Facebook has launched an automatic alt text system, which still has a lot of room for improvement. After exploring the Audio Description service that provides visual information for the visually impaired, this research proposes that the alt text of posts should be adjusted according to the types of posts. This research found that since successive posts spend a smaller proportion of the text explaining the image information, the posts are suitable for specific alt text. The text of the illustrated post contains a lot of image information, so these posts are appropriate for specific or abstract alt text. The research results are compiled into a flowchart of the different ways of producing alt text provides a reference for future related research or the development of related services.



關鍵詞


替代文字; 口述影像; 螢幕翻譯; 可用性; 圖像索引

全文:

PDF

參考文獻


Armitage, L. H., & Enser, P. G. (1997). Analysis of user need in image archives. Journal of information science, 23(4), 287-299.

Barthes, R. (1993). Rhetoric of the Image: na.

BBC. (2020). Retrieved from https://www.bbc.com/news

Caro, M. R. (2016). Testing audio narration: the emotional impact of language in audio description. Perspectives, 24(4), 606-634. doi:10.1080/0907676X.2015.1120760

Di Giovanni, E. (2014). Visual and narrative priorities of the blind and non-blind: Eye tracking and audio description. Perspectives, 22(1), 136-153.

Facebook. (2016a). Learning to Segment. Retrieved from https://research.fb.com/learning-to-segment/

Facebook. (2016b). Under the hood: Building accessibility tools for the visually impaired on Facebook. Retrieved from https://code.fb.com/ios/under-the-hood-building-accessibility-tools-for-the-visually-impaired-on-facebook/

Facebook (2017). Accessibility Research: Developing automatic-alt text for Facebook screen reader users. Retrieved from https://research.fb.com/blog/2017/02/accessibility-research-developing-automatic-alt-text-for-facebook-screen-reader-users/

Facebook. (2018). Using AI to help people with visual impairments share images on Facebook. Retrieved from https://research.fb.com/using-ai-to-help-people-with-visual-impairments-share-images-on-facebook/

Facebook. (2019). 自動替代文字如何運作?. Retrieved from https://www.facebook.com/help/216219865403298?__tn__=-UK-R

Facebook (Producer). (2021). How Facebook is using AI to improve photo descriptions for people who are blind or visually impaired. Retrieved from https://ai.facebook.com/blog/how-facebook-is-using-ai-to-improve-photo-descriptions-for-people-who-are-blind-or-visually-impaired/

Hollink, L., Schreiber, A. T., Wielinga, B. J., & Worring, M. (2004). Classification of user image descriptions. International Journal of Human-Computer Studies, 61(5), 601-626. doi:https://doi.org/10.1016/j.ijhcs.2004.03.002

Jaimes, A., & Chang, S.-F. (1999). Conceptual framework for indexing visual information at multiple levels. Paper presented at the Internet Imaging.

Jörgensen, C., Jaimes, A., Benitez, A. B., & Chang, S.-F. (2001). A conceptual framework and empirical research for classifying visual descriptors. Journal of the American Society for Information Science and Technology, 52(11), 938-947. doi:10.1002/asi.1161

Kemp, S. (2019). DIGITAL 2019: GLOBAL DIGITAL OVERVIEW. Retrieved from https://datareportal.com/reports/digital-2019-global-digital-overview

Kress, G. R., & Van Leeuwen, T. (1996). Reading images: The grammar of visual design: Psychology Press.

Kruger, J.-L. (2010). Audio narration: re-narrativising film. Perspectives, 18(3), 231-249. doi:10.1080/0907676X.2010.485686

Lazar, J., Allen, A., Kleinman, J., & Malarkey, C. (2007). What Frustrates Screen Reader Users on the Web: A Study of 100 Blind Users. International Journal of Human–Computer Interaction, 22(3), 247-269. doi:10.1080/10447310709336964

Morris, M. R., Zolyomi, A., Yao, C., Bahram, S., Bigham, J. P., & Kane, S. K. (2016). With most of it being pictures now, I rarely use it: Understanding Twitter's Evolving Accessibility to Blind Users. Paper presented at the Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems.

Shatford, S. (1986). Analyzing the subject of a picture: a theoretical approach. Cataloging & classification quarterly, 6(3), 39-62.

Times, T. N. Y. (2020). Retrieved from https://www.nytimes.com/

Voykinska, V., Azenkot, S., Wu, S., & Leshed, G. (2016). How blind people interact with visual content on social networking services. Paper presented at the Proceedings of the 19th ACM Conference on Computer-Supported Cooperative Work & Social Computing.

Walczak, A., & Fryer, L. (2017). Creative description: The impact of audio description style on presence in visually impaired audiences. British Journal of Visual Impairment, 35(1), 6-17.

Wu, S., & Adamic, L. A. (2014). Visually impaired users on an online social network. Paper presented at the Proceedings of the SIGCHI Conference on Human Factors in Computing Systems.

Wu, S., Wieland, J., Farivar, O., & Schiller, J. (2017). Automatic alt-text: Computer-generated image descriptions for blind users on a social network service. Paper presented at the Proceedings of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing.

口述影像服務網. (2019). 什麼是口述影像 (Audio Description)?. Retrieved from https://www.ad.org.tw/%e4%bb%80%e9%ba%bc%e6%98%af%e5%8f%a3%e8%bf%b0%e5%bd%b1%e5%83%8f-audio-description%ef%bc%89%ef%bc%9f/

翁煌德. (2019). 口述影像:給視障者、也給所有人的全人可能性. Retrieved from https://hef.org.tw/journal365-3/

臧國仁、蔡琰. (2017). 敘事傳播:故事/人文觀點.

趙雅麗. (2002). 言語世界中的流動光影:口述影像的理論建構.