语音合成软件的设计毕业论文

2022-01-13 20:51:39

论文总字数：21651字

摘要

随着科技的飞速发展，人类即将进入人工智能时代。语音识别和语音合成技术作为人工智能的热门话题受到了众多研究者的关注。语音合成是实现人机交互的基础技术，在人工智能时代，实现将机器的文本转换成语音数据播放，是人机交互的最后的一环。

本文首先概述了语音合成技术的研究背景、研究现状；接着简单介绍了语音合成技术实现的具体方法和其优缺点，同时介绍了本设计使用的阿里云语音合成服务接口功能；接着简单介绍了本设计中使用的Java语言及开发工具Android Studio的基本功能；最后阐述了利用以上工具，编写程序，实现了文本转语音、播放音频、转换音频格式、保存音频等功能，详细阐释了程序实现流程和过程操作，完成了语音合成APP的设计。

关键词：语音合成 Android Java 音频开发

Design of Speech synthesis software

ABSTRACT

With the rapid development of science and technology, human beings are about to enter the era of artificial intelligence^[1]. Speech recognition and speech synthesis, as a hot topic of artificial intelligence, have attracted the attention of many researchers. Speech synthesis is the basic technology to realize human-computer interaction. In the era of artificial intelligence, it is the final link of human-computer interaction to realize the conversion of machine text and speech into speech data playback.

Firstly, the research background and present situation of speech synthesis technology are summarized^[2]. Then it briefly introduces the specific methods and advantages and disadvantages of speech synthesis technology, and introduces the interface function of aliyun speech synthesis service used in this design. Then it briefly introduces the Java language used in this design and the basic functions of Android Studio. In the end, the above tools are used to write programs to realize functions such as text to speech, audio playback, audio format conversion and audio saving. The program implementation process and process operation are explained in detail, and the design of speech synthesis APP is completed.

Key Words: speech synthesis, Android, Java, audio development

摘要 I

ABSTRACT II

第一章前言 1

1.1 课题背景 1

1.2 语音合成算法综述 2

1.3本文主要工作和流程 3

1.3.1本设计的程序架构流程 3

1.3.2本文的主要结构 3

第二章语音合成算法和实现 5

2.1语音合成原理 5

2.1.1 文本和韵律分析 5

2.1.2 声学模型 6

2.1.3 声码器 6

2.2 常见语音合成算法 6

2.2.1 共振峰合成 6

2.2.2 发音规则合成 6

2.2.3 波形拼接合成 7

2.2.4 单元选择合成 7

2.2.5 基于隐马尔科夫模型合成 7

2.2.6 深度神经网络合成 7

2.3基于阿里云语音合成API的语音合成交互流程 7

2.3.1 阿里云语音合成接口介绍 8

2.3.2 阿里云语音合成交互流程 8

2.4 本章小结 9

第三章 APP开发平台和工具 10

3.1 Android Studio工具介绍 10

3.1.1 Android Studio的优势 10

3.1.2 Spinner控件 11

3.1.3 SeekBar控件 12

3.1.4 Button控件 12

3.1.5 EditText控件 12

3.1.6 TextView控件 13

3.1.7 布局 13

3.1.8 事件Activity 13

3.1.9 常用音频格式 13

3.1.10 Android中的文件存储 16

3.1.11 Android中的多媒体播放器 18

3.2 Java开发语言 18

3.2.1 Java语言的优势 18

3.2.2 Java的编程环境 20

3.3 本章小结 20

第四章实现过程 21

4.1 可行性分析 21

4.2 需求及功能结构分析 21

4.2.1 系统目标 21

4.2.2 系统功能结构 22

4.3 开发环境的搭建 22

4.4 界面设计 22

4.5 Activity程序实现 25

4.5.1 获取合成文本 26

4.5.2 获取选择的参数 26

4.5.3 语音合成接口调用和其回调方法 26

4.5.4 播放音频 27

4.5.5 转换音频格式并保存 28

4.6 本章小结 29

第五章总结和展望 30

5.1 工作总结 30

5.2 前景和展望 30

参考文献 31

致谢 33

前言

课题背景

现代社会已经进入数字化信息时代，网络技术和多媒体技术获得迅猛发展，计算机与人之间的交互日益频繁。如何使电脑具有类似于人一样的听、说能力，成为自90年代以来信息产业的研究热点。要建立一个具有听、说能力的计算机语音系统，必需的两项关键技术就是语音识别技术与语音合成技术^[1]。语音合成技术，又称文语转换(TTS)技术，就是将任意文字信息转化为标准流畅的语音朗读出来。语音合成技术可以改善人机交互困难的局面，使得人类和计算机的交流更加方便快捷^[3]。近几十年来国际和国内对于语音合成技术的研究主要集中在按规则进行文语转换，即将书面语言转换成口头语言^[4]。

可以预见的是，语音合成技术的进步，将会给社会带来极大的变化和发展。语音合成技术的进步和推广，不仅可以帮助汉语语音应用的快速推广和发展，为信息大众化、社会化提供推动力。基于汉语语音合成技术在电子计算机领域的应用，可以使得计算机操作使用更加贴近生活，更加人性化，帮助普通人更容易使用电子产品。其便民性和普适性可以为社会带来新的动力，促进新产品的诞生和发展，这将会带来广阔的经济效益。

请支付后下载全文，论文总字数：21651字

您需要先支付 50元 才能查看全部内容！立即支付

注册

找回密码

语音合成软件的设计毕业论文

ABSTRACT

前言

课题背景

您可能感兴趣的文章

最新文档

推荐栏目

登录

注册

找回密码

语音合成软件的设计毕业论文

ABSTRACT

前言

课题背景

您可能感兴趣的文章

最新文档

推荐栏目