声における重ね合わせに関する認知科学研究の可能性: 量子論的世界観を踏まえた自己観と人間観の展望

Daisuke Hayashi

doi:10.51094/jxiv.1293

##article.authors##

Daisuke Hayashi Japan Tobacco Inc., D-LAB https://researchmap.jp/d_s_hayashi

DOI:

https://doi.org/10.51094/jxiv.1293

Keywords:

voice, superposition, pop culture, voice engineering technology, view of self, view of humanity

Abstract

This paper explores the potential of cognitive science research into "superposition in voice," a concept inspired by quantum theory, to examine how voices can simultaneously embody multiple identities or interpretations. Focusing on three perspectives (self, individuality, and humanity), the paper refers to various research relating to each perspective in multiple disciplines. It shows that the advancements in human skills related to voice in pop culture such as voice actor and VTuber, as well as in engineering technologies such as speech synthesis and voice conversion, are blurring the boundaries between “self and others,” “speaker and character,” and “human and machine.” It suggests that these blurred boundaries can be viewed as "superposition in voice." Based on this point of view, the paper discusses the various questions relating to each of the three perspectives, and proposes the possible research approaches in cognitive science, such as quantum cognition and projection science. It also highlights the importance of a digital archive of voices in pop culture, as well as the ethical, legal, and social issues surrounding voice engineering. Overall, the paper advocates interdisciplinary research that bridges the gap between the humanities and engineering in order to redefine future views of the self and humanity.

Conflicts of Interest Disclosure

The author declares no potential conflicts of interest in this article.

Downloads *Displays the aggregated results up to the previous day.

Download data is not yet available.

References

Arakawa, R., Kashino, Z., Takamichi, S., Verhulst, A., & Inami, M. (2021). Digital speech makeup: Voice conversion based altered auditory feedback for transforming self-representation. Proceedings of the 2021 International Conference on Multimodal Interaction, 159-167.

荒岡草馬・篠田詩織・藤村明子・成原慧 (2023). 声の人格権に関する検討情報ネットワーク・ローレビュー, 22, 24-44.

Aronovitch, C. D. (1976). The voice of personality: stereotyped judgments and their relation to voice quality and sex of speaker. Journal of Social Psychology, 99(2), 207-220.

浅井智久・丹野義彦 (2010). 声の中の自己と他者：幻聴の自己モニタリング仮説心理学研究, 81(3), 247-261.

新八角 (2018). 月ノ美兎は水を飲むユリイカ, 50(9), 92-99.

Aucouturier, J. J., Johansson, P., Hall, L., Segnini, R., Mercadié, L., & Watanabe, K. (2016). Covert digital manipulation of vocal emotion alter speakers’ emotional states in a congruent direction. Proceedings of the National Academy of Sciences, 113(4), 948-953.

バーチャル美少女ねむ (2022). メタバース進化論仮想現実の荒野に芽吹く「解放」と「創造」の新世界技術評論社

Barnett, J. (2023). The ethical implications of generative audio models: A systematic literature review. Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 146-161.

Bellaiche, L., Shahi, R., Turpin, M. H., Ragnhildstveit, A., Sprockett, S., Barr, N., Christensen, A., & Seli, P. (2023). Humans versus AI: whether and why we prefer human-created compared to AI-created artwork. Cognitive Research: Principles and Implications, 8:42, 1-22.

Belin, P., Boehme, B., & McAleer, P. (2017). The sound of trustworthiness: Acoustic-based modulation of perceived voice personality. PLoS ONE, 12(10), e0185651.

Bredikhina, L. (2022). Babiniku: what lies behind the virtual performance. Contesting gender norms through technology and Japanese theatre. Electronic Journal of Contemporary Japanese Studies, 22(2), 1-22.

Broom, T. W., Chavez, R. S., & Wagner, D. D. (2021). Becoming the King in the North: identification with fictional characters is associated with greater self-other neural overlap. Social Cognitive and Affective Neuroscience, 16(6), 541-551.

Brown, S., Cockett, P., & Yuan, Y. (2019). The neuroscience of Romeo and Juliet: an fMRI study of acting. Royal Society Open Science, 6(3), 181908.

Bruckert, L., Bestelmeyer, P., Latinus, M., Rouger, J., Charest, I., Rousselet, G. A., Kawahara, H., & Belin, P. (2010). Vocal attractiveness increases by averaging. Current Biology, 20(2), 116-120.

Chiba, Y., & Ito, A. (2024). Speaker intimacy estimation in chat-talks based on verbal and non-verbal information. IEEE Access, 12, 184592-184606.

Cohn, M., Bandodkar, G., Sangani, R. B., Predeck, K., & Zellou, G. (2024). Do people mirror emotion differently with a human or TTS voice? Comparing listener ratings and word embeddings. Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 1-10.

Crochiquia, A., Eriksson, A., Fontes, M. A., & Madureira, S. (2020). A phonetic study of Zootopia characters’ voices in Brazilian Portuguese dubbing: the role of stereotypes. DELTA: Documentação de Estudos em Lingüística Teórica e Aplicada, 36(3), 1-46.

Daryadar, M., & Raghibi, M. (2015). The effect of listening to recordings of one's voice on attentional bias and auditory verbal learning. International Journal of Psychological Studies, 7(2), 155-163.

藤津亮太 (2017). 声優語：アニメに命を吹き込むプロフェッショナル一迅社

藤津亮太 (2018). 声優論：通史的，実証的一考察小川昌宏・須川亜紀子 (編著) アニメ研究入門【応用編】：アニメを究める11つのコツ (pp. 93-117) 現代書館

藤津亮太 (2019). プロフェッショナル13人が語るわたしの声優道河出書房新社

福田航希・阪井瞭介・松下嶺佑・國見友亮・高道慎之介 (2025). ELEVATE：学習者自身の自己聴取音声で聴く講義システムインタラクション2025論文集, 1B-21, 314-319.

布山美慕・西郷甲矢人 (2022). 解釈の不定性の価値と量子認知による文章解釈研究の展望認知科学, 29(1), 100-119.

Gallagher, S. (2000). Philosophical conceptions of the self: implications for cognitive science. Trends in Cognitive Sciences, 4(1), 14-21.

Goldstein, T. R., & Filipe, A. (2018). The interpreted mind: Understanding acting. Review of General Psychology, 22(2), 220-229.

Grawe, G. (2018). 日本文化における「声」立命館言語文化研究, 29, 155-173.

原雄太郎・伊藤克亘 (2009). 声優の発話の音響特徴量分析及び確率モデルの作成情報科学技術フォーラム講演論文集, 8(2), 369-372.

Hatada, Y., Barbareschi, G., Takeuchi, K., Kato, H., Yoshifuji, K., Minamizawa, K., & Narumi, T. (2024). People with disabilities redefining identity through robotic and virtual avatars: A case study in avatar robot cafe. Proceedings of the CHI Conference on Human Factors in Computing Systems, 61, 1-13.

林大輔（2019）. 声優のキャラクター演技音声を用いた音声知覚に関する実験研究愛知淑徳大学論集―人間情報学部篇, 9, 49-62.

林大輔・森勢将雅 (2025). 女性声優の演技音声における年齢・性別の表現と関連する音響特徴量 Jxiv.

林大輔・大杉尚之 (2021). アニメにおける声質に対する印象の調査：日常場面およびドラマとの比較 PsyArXiv.

Holzman, P. S., & Rousey, C. (1966). The voice as a percept. Journal of Personality and Social Psychology, 4(1), 79-86.

本多清志 (2018). 実験音声科学：音声事象の成立過程を探るコロナ社

本間裕之 (2024). 「身体」と「魂」としてのVTuber 岡本健・山野弘樹・吉川彗 (編著) VTuber学 (pp. 323-337) 岩波書店

Hosaka, T., Kimura, M., & Yotsumoto, Y. (2021). Neural representations of own-voice in the human auditory cortex. Scientific Reports, 11(1), 591.

Hughes, S. M., & Harrison, M. A. (2013). I like my voice better: Self-enhancement bias in perceptions of voice attractiveness. Perception, 42(9), 941-949.

Iizuka, T., & Mori, H. (2022). How does a spontaneously speaking conversational agent affect user behavior? IEEE Access, 10, 111042-111051.

稲岡大志 (2016). 堀江由衣をめぐる試論：音声・キャラクター・同一性フィルカル, 1(2), 112-140.

井上晴菜 (2022). 言語的符号化が標的音声の話者同定に与える影響心理学研究, 93(4), 320-329.

石田美紀 (2020). アニメと声優のメディア史：なぜ女性が少年を演じるのか青弓社

Ishi, C.T., Utsugi, A., & Ota, I. (2023). Voice types and voice quality in Japanese anime. Proceedings of the 20th International Congress of Phonetic Sciences, 3632-3636.

石井沙季・伊藤克亘 (2019). キャラクター音声のステレオタイプ識別のための音響分析情報処理学会第81回全国大会講演論文集, 4, 695-696.

ジンリーファン (2010). 日本のアニメーションにおける音声の機能：「GHOST IN THE SHELL/攻殻機動隊」「新世紀エヴァンゲリオン」「蒼穹のファフナー」を中心に北海道大学大学院文学研究科研究論集, 10, 235-251.

Jones, B. C., Feinberg, D. R., DeBruine, L. M., Little, A. C., & Vukovic, J. (2010). A domain-specific opposite-sex bias in human preferences for manipulated voice pitch. Animal Behaviour, 79(1), 57-62.

Jürgens, R., Hammerschmidt, K., & Fischer, J. (2011). Authentic and play-acted vocal emotion expressions reveal acoustic differences. Frontiers in Psychology, 2(180), 1-11.

Kao, D., Ratan, R., Mousas, C., & Magana, A. J. (2021). The effects of a self-similar avatar voice in educational games. Proceedings of the ACM on Human-Computer Interaction, 5(CHI PLAY), 1-28.

Kawahara, S. (2016). The prosodic features of the "moe" and "tsun" voices. Journal of the Phonetic Society of Japan, 20(2), 102-110.

川村覚文 (2024). 聖なるもの，情動，プラットフォーム：声優/キャラ・ライブコンサートにみるリアリティの複数性須川亜紀子（編）2.5次元学入門 (pp. 81-113) 青土社

木戸博・粕谷英樹 (1999). 通常発話の声質に関連した日常表現語の抽出日本音響学会誌, 55(6), 405-411.

Kimura, M., & Yotsumoto, Y. (2018). Auditory traits of “own voice.” PLoS ONE, 13(6), e0199443.

Kitamura, T., Honda, K., & Takemoto, H. (2005). Individual variation of the hypopharyngeal cavities and its acoustic effects. Acoustical Science and Technology, 26(1), 16-26.

Krauss, R. M., Freyberg, R., & Morsella, E. (2002). Inferring speakers’ physical attributes from their voices. Journal of Experimental Social Psychology, 38, 618-625.

久保(川合) 南海子 (2022). 「推し」の科学：プロジェクション・サイエンスとは何か集英社新書

國見友亮・畑田裕二・木村健太・鳴海拓志・持丸正明 (2024). 声質変化に伴う自己意識の変化についてのアンケート調査第29回日本バーチャルリアリティ学会大会論文集, 2D1-08.

Kunimi, Y., Kimura, K., Matsumoto, K., Takamichi, S., Narumi, T., & Mochimaru, M. (2024). Character-voice embodiment impacts on the cognitive task performance with the voice ownership illusion. International Conference on Artificial Reality and Telexistence Eurographics Symposium on Virtual Environments 2024, 1-10.

倉井龍太郎・平木剛史 (2023). ソーシャルVRプラットフォームにおけるエージェントAPIの提案第28回日本バーチャルリアリティ学会大会論文集, 3D1-08.

倉田将希・高道慎之介・佐伯高明・荒川陸・齋藤佑樹・樋口啓太・猿渡洋 (2021). リアルタイムDNN音声変換フィードバックによるキャラクタ性の獲得手法研究報告音声言語情報処理(SLP), 2021(31), 1-6.

Kuriki, S., Tamura, Y., Igarashi, M., Kato, N., & Nakano, T. (2016). Similar impressions of humanness for human and artificial singing voices in autism spectrum disorders. Cognition, 153, 1-5.

栗田茂二朗 (1988). 声帯の成長，発達と老化：とくに層構造の加齢的変化音声言語医学, 29(2), 185-193.

黒嵜想 (2018). 縫い付けられた声ユリイカ, 50(9), 188-195.

Latinus, M., & Belin, P. (2011). Anti-voice adaptation suggests prototype-based coding of voice identity. Frontiers in Psychology, 2(175), 1-12.

Laukka, P., Neiberg, D., Forsell, M., Karlsson, I., & Elenius, K. (2011). Expression of affect in spontaneous speech: Acoustic correlates and automatic detection of irritation and resignation. Computer Speech and Language, 25(1), 84-104.

Lu, Z., Shen, C., Li, J., Shen, H., & Wigdor, D. (2021). More kawaii than a real-person live streamer: Understanding how the otaku community engages with and perceives virtual YouTubers. Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, 1-14.

Lundh, L. G., Berg, B., Johansson, H., Nilsson, L. K., Sandberg, J., & Segerstedt, A. (2002). Social anxiety is associated with a negatively distorted perception of one's own voice. Cognitive Behaviour Therapy, 31(1), 25-30.

丸島歩 (2020a). 女性声優による役柄の性別の異なる音声の音響的特徴-基本周波数に着目して大阪経済法科大学論集, 115, 23-33.

丸島歩 (2020b). 女性声優の演技音声にあらわれるジェンダーの表現：母音フォルマントに着目して年報新人文学, 17, 165-139.

増野亜子 (2014). 声の世界を旅する音楽之友社

松本大輝 (2017). その歌は緑の髪をしている：ボーカロイドとメイクビリーブフィルカル, 2(2), 96-142.

松本大輝 (2024). フィクショナル・キャラクターとしてのVTuber 岡本健・山野弘樹・吉川彗 (編著) VTuber学 (pp. 307-322) 岩波書店

Matsunaga, Y., Saeki, T., Takamichi, S., & Saruwatari, H. (2022). Improving robustness of spontaneous speech synthesis with linguistic speech regularization and pseudo-filled-pause insertion. arXiv.

McAleer, P., Todorov, A., & Belin, P. (2014). How do you say ‘Hello’? Personality impressions from brief novel voices. PLoS ONE, 9(3), e90779.

三原鉄也 (2022). メディア芸術に関する行政施策の展開とデジタルアーカイブデジタルアーカイブ学会誌, 6(1), 15-19.

三谷雅純 (2014). 生涯学習施設の館内放送はどうあるべきか：聴覚実験による肉声と人工合成音声の聞きやすさの比較人と自然, 25, 63-74.

三井康行 (2010). 音声合成の利用シーンと要求される品質との関係研究報告音声言語情報処理(SLP), 2010(8), 1-4.

Mitsui, K., Hono, Y., & Sawada, K. (2023). Towards human-like spoken dialogue generation between ai agents from written dialogue. arXiv.

宮島崇浩・菊池英明・白井克彦・大川茂樹 (2013). 演技指示の工夫が与える音声表現への影響：表現豊かな演技音声表現の獲得を目指して音声研究, 17(3), 10-23.

宮本直美 (2017). 嗜好対象としての歌声：クラシック歌唱からポピュラー歌唱へ嗜好品文化研究, 2017(2), 26-37.

モクタリ明子 (2008). 自然発話にみられる人物像に応じた個人内音声バリエーション神戸大学総合人間科学研究科博士学位論文

森大毅・前川喜久雄・粕谷英樹 (2014). 音声は何を伝えているか：感情・パラ言語情報・個人性の音声科学コロナ社

村瀬ひろみ・須川亜紀子 (2014). ジェンダー論アニメから読み解くジェンダー：「力」と「美」を超えて小川昌宏・須川亜紀子 (編著) 増補改訂版アニメ研究入門：アニメを究める9つのツボ (pp. 76-95) 現代書館

永田大輔 (2024). アイドル声優はなぜ問題となるのか：メディア史の中の「キャリア」メディウム, 5, 19-34.

中川優奈・田中章浩 (2021). 自分と他人の声の境界は変化するか信学技報, 121(177), 25-30.

難波優輝 (2018). バーチャルYouTuberの三つの身体：パーソン，ペルソナ，キャラクタユリイカ, 50(9), 117-125.

鳴海拓志 (2019). ゴーストエンジニアリング：身体変容による認知拡張の活用に向けて認知科学, 26(1), 14-29.

成瀬加菜・吉田成朗・高道慎之介・鳴海拓志・谷川智洋・廣瀬通孝 (2019). 自信声フィードバックによる緊張緩和手法の提案：クラウドソーシングを利用した自信声加工パラメータの推定第24回日本バーチャルリアリティ学会大会論文集, 1B-05.

Nguyen, T. A., Kharitonov, E., Copet, J., Adi, Y., Hsu, W-N., Elkahky, A., Tomasello, P., Algayres, R., Sago, B., Mohamed, A., & Dupoux, E. (2023). Generative spoken dialogue language modeling. Transactions of the Association for Computational Linguistics, 11, 250-266.

日本音響学会(編) (1996). 音のなんでも小事典：脳が音を聴くしくみから超音波顕微鏡まで講談社

日本音響学会(編) (2003). 新版音響用語辞典コロナ社

Ogawa, N., Baba, J., & Nakanishi, J. (2024). Investigating effect of altered auditory feedback on self-representation, subjective operator experience, and task performance in teleoperation of a social robot. Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems, 1-18.

Ohata, R., Asai, T., Imaizumi, S., & Imamizu, H. (2022). I hear my voice; therefore I spoke: The sense of agency over speech is enhanced by hearing one’s own voice. Psychological science, 33(8), 1226-1239.

Ostrega, J., Shiramizu, V., Lee, A. J., Jones, B. C., & Feinberg, D. R. (2024). No evidence that averaging voices influences attractiveness. Scientific Reports, 14(1), 10488.

皇牙サキ (2018). 「声」という商品のパッケージとしてのVTuber ユリイカ, 50(9), 64-65.

Peng, Z., Wang, Y., Meng, L., Liu, H., & Hu, Z. (2019). One's own and similar voices are more attractive than other voices. Australian Journal of Psychology, 71(3), 212-222.

Qi, T., Zheng, W., Lu, C., Zong, Y., & Lian, H. (2024). Pavits: Exploring prosody-aware vits for end-to-end emotional voice conversion. IEEE International Conference on Acoustics, Speech and Signal Processing, 12697-12701.

Saeki, T., Takamichi, S., Nakamura, T., Tanji, N., & Saruwatari, H. (2023). Selfremaster: Self-supervised speech restoration for historical audio resources. IEEE Access, 11, 144831-144843.

Saito, Y., Nishimura, Y., Takamichi, S., Tachibana, K., & Saruwatari, H. (2022). STUDIES: Corpus of Japanese empathetic dialogue speech towards friendly voice agent. arXiv.

Samo, A., & Highhouse, S. (2023). Artificial intelligence and art: Identifying the aesthetic judgment factors that distinguish human-and machine-generated artwork. Psychology of Aesthetics, Creativity, and the Arts, Advance online publication.

佐藤茉奈花 (2023). 同一声優による異なる性格を持つキャラクターの演技音声の分析社会言語科学会第47回大会発表論文集, 2-3, 29-32.

さやわか (2015). キャラの思考法：現代文化論のアップグレード青土社

Scherer, K. R. (1972). Judging personality from voice: A cross-cultural approach to an old issue in interpersonal perception. Journal of Personality, 40(2), 191-210.

Scherer, K. R. (1978). Personality inference from voice quality: The loud voice of extroversion. European Journal of Social Psychology, 8(4), 467-487.

Scherer, K. R. (2003). Vocal communication of emotion: A review of research paradigms. Speech Communication, 40(1-2), 227-256.

Schweinberger, S. R., Casper, C., Hauthal, N., Kaufmann, J. M., Kawahara, H., Kloth, N., Robertson, D. M. C., Simpson, A. P., & Zäske, R. (2008). Auditory adaptation in voice perception. Current Biology, 18(9), 684-688.

Schweinberger, S. R., Kawahara, H., Simpson, A. P., Skuk, V. G., & Zaske, R. (2014). Speaker perception. WIREs Cognitive Science, 5, 15-25.

関根麻里恵 (2024). メタVTuberコンテンツの表象文化研究：「匿名性」「有名性」「声」「ジェンダー」から考える岡本健・山野弘樹・吉川彗 (編著) VTuber学 (pp. 179-195) 岩波書店

柴那典 (2014). 初音ミクはなぜ世界を変えたのか？太田出版

嶋田総太郎 (2019). 脳のなかの自己と他者：身体性・社会性の認知脳科学と哲学共立出版

標葉隆馬・見上公一 (編)(2024). 入門科学技術と社会ナカニシヤ出版

シノハラユウキ (2017). メディアを跨ぐヴィヴィッドな想像：『Tokyo 7th シスターズ』における「跳ぶよ」というセリフの事例からフィルカル, 2(2), 60-95.

篠崎大河 (2024). 実在する配信者としてのVTuber 岡本健・山野弘樹・吉川彗 (編著) VTuber学 (pp. 271-288) 岩波書店

Sisman, B., Yamagishi, J., King, S., & Li, H. (2021). An overview of voice conversion and its challenges: From statistical modeling to deep learning. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, 132-157.

Starr, R. L. (2015). Sweet voice: The role of voice quality in a Japanese feminine style. Language in Society, 44(1), 1-34.

Suda, H., Watanabe, A., & Takamichi, S. (2024). Who finds this voice attractive? A large-scale experiment using in-the-wild data. Proceedings of Interspeech 2024, 3165-3169.

須川亜紀子 (2021). 2.5次元文化論：舞台・キャラクター・ファンダム青弓社

鈴木真吾 (2014). サウンド/ヴォイス研究アニメを奏でる3つの音：アニメにとって音とは何か小川昌宏・須川亜紀子 (編著) 増補改訂版アニメ研究入門：アニメを究める9つのツボ (pp. 96-119) 現代書館

高道慎之介 (2021). 音声アバターを選ぶ時代：ボイスチェンジャー技術の動向電気学会誌, 141(2), 93-96.

Tan, X., Chen, J., Liu, H., Cong, J., Zhang, C., Liu, Y., Wang, X., Leng, Y., Yi, Y., He, L., Soong, F., Qin, T., Zhao, S., & Liu, T-Y. (2024). Naturalspeech: End-to-end text-to-speech synthesis with human-level quality. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(6), 4234-4245.

田中彰吾 (編著)(2023). 自己の科学は可能か：心身脳問題として考える新曜社

程斯 (2023). アニメ・キャラクターとアニメ声優を同一視する現象の原理的基盤をめぐって Phantastopia, 2, 122-139.

Teshigawara, M. (2004). Vocally expressed emotions and stereotypes in Japanese animation: Voice qualities of the bad guys compared to those of the good guys. Journal of the Phonetic Society of Japan, 8(1), 60-76.

勅使河原三保子 (2019). 声に関するステレオタイプの解明に向けて：音声に基づく人物像の知覚の 3 次元モデル駒澤大学外国語論集, 27, 1-19.

勅使河原三保子・伊藤克亘・武田一哉 (2005). 日本のアニメの音声に表された感情と性格：声のステレオタイプの音声学的研究電子情報通信学会技術研究報告, 105(291), 39-44.

Titze, I. R. (1989). Physiologic and acoustic differences between male and female voices. Journal of the Acoustical Society of America, 85(4), 1699-1707.

Uchida, T., Takahashi, H., Ban, M., Shimaya, J., Minato, T., Ogawa, K., Yoshikawa, Y., & Ishiguro, H. (2020). Japanese young women did not discriminate between robots and humans as listeners for their self-disclosure-pilot study. Multimodal Technologies and Interaction, 4(35), 1-16.

内田照久 (2000). 音声の発話速度の制御がピッチ感及び話者の性格印象に与える影響日本音響学会誌, 56(6), 396-405.

内田照久 (2002). 音声の発話速度が話者の性格印象に与える影響心理学研究, 73(2), 131-139.

内田照久 (2005a). 音声の発話速度と休止時間が話者の性格印象と自然なわかりやすさに与える影響教育心理学研究, 53(1), 1-13.

内田照久 (2005b). 音声中の抑揚の大きさと変化パターンが話者の性格印象に与える影響心理学研究, 76(4), 382-390.

内田照久 (2006). 未知のイントネーションから想起される話者の性格印象と方言地域の特徴音声研究, 10(3), 29-42.

内田照久 (2009). 音声の韻律的特徴と話者のパーソナリティ印象の関係性音声研究, 13(1), 17-28.

内田照久 (2011). 音声中の母音の明瞭性が話者の性格印象と話し方の評価に与える影響心理学研究, 82(5), 433-441.

内田照久・森勢将雅 (2020). 声のピッチ感の錯覚と疑似歌声・疑似ささやき声による検討情報処理学会論文誌, 61(4), 807-816.

内田照久・中畝菜穂子 (2004). 声の高さと発話速度が話者の性格印象に与える影響心理学研究, 75(5), 397-406.

Ueda, A., Takahashi, H., Yoshikawa, Y., Ishiguro, H., & Nomura, H. (2021). Do robots facilitate life review narratives of older adults? A preliminary study. Gerontechnology, 20(2), 1-12.

Utsugi, A., Wang, H., & Ota, I. (2019). A voice quality analysis of Japanese anime. Proceedings of the 19th International Congress of the Phonetic Sciences, 1853-1857.

Van Lancker, D., Kreiman, J., & Emmorey, K. (1985). Familiar voice recognition: patterns and parameters Part I: Recognition of backward voices. Journal of Phonetics, 13, 19-38.

Vorperian, H. K., Wang, S., Chung, M. K., Schimek, E. M., Durtschi, R. B., Kent, R. D., Ziegert, A. J., & Gentry, L. R. (2009). Anatomic development of the oral and pharyngeal portions of the vocal tract: An imaging study. Journal of the Acoustical Society of America, 125(3), 1666-1678.

Vorperian, H. K., Wang, S., Schimek, E. M., Durtschi, R. B., Kent, R. D., Gentry, L. R., & Chung, M. K. (2011). Developmental sexual dimorphism of the oral and pharyngeal portions of the vocal tract: An imaging study. Journal of Speech, Language, and Hearing Research, 54(4), 995-1010.

Walton, K. L. (1990). Mimesis as make-believe: On the foundations of the representational arts. Harvard University Press.（ウォルトン・ケンダル田村均 (訳) (2016). フィクションとは何か：ごっこ遊びと芸術名古屋大学出版会）

Weiss, B., Trouvain, J., Barkat-Defradas, M., & Ohala, J. J. (Ed.) (2021). Voice attractiveness: Studies on sexy, likable, and charismatic speakers. Springer Singapore.

Williams, C. E., & Stevens, K. N. (1972). Emotions and speech: Some acoustical correlates. Journal of the Acoustical Society of America, 52(4B), 1238-1250.

Xin, D., Takamichi, S., Morimatsu, A., & Saruwatari, H. (2023). Laughter synthesis using pseudo phonetic tokens with a large-scale in-the-wild laughter corpus. Proceedings of Interspeech 2023, 17-21.

山田真也・伊藤敏彦・荒木健治 (2005). 対話相手の音声の品質を考慮した対話状況での言語的・音響的特徴の分析および様々な観点からの考察情報処理学会研究報告音声言語情報処理(SLP), 2005(127), 67-72.

山中涼雅・大佐健人・藤原朱里・耿浩彭・齋藤大輔・峯松信明・井上雄介 (2025). 学習者本人の自己聴取音の声質で合成されたモデル音声を用いた発音学習とその効果 2025年春季日本音響学会研究発表会講演論文集, 1165-1168.

山野弘樹 (2024). VTuberの哲学春秋社

山野弘樹 (2025). 「VTuber文化」を哲学する第29回 VTuberスタイル, 2025(6), 78.

Yanagida, H., Ijima, Y., & Tawara, N. (2023). Influence of personal traits on impressions of one’s own voice. Proceedings of Interspeech 2023, 5212-5216.

横森文哉・二宮大和・森勢将雅・田中章浩・小澤賢司(2016). 好感度評価の性差に着目した女性発話の音響特徴量分析日本感性工学会論文誌, 15(7), 721 -729.

Cognitive science research about superposition in voice

Possible future views of self and humanity based on quantum perspective

##article.authors##

DOI:

Keywords:

Abstract

Conflicts of Interest Disclosure

Downloads *Displays the aggregated results up to the previous day.

References

Downloads

Posted

License

Language