AITalk^® WEBAPI［AICloud］

Ideal for WEB services! Utilizes a easy yet high-quality
speech synthesis

WEB API is a service that can utilize high-quality speech synthesis engine AITalk^® in SaaS format. There is no need to develop and apply the server for speech synthesis on your own, you can easily start utilizing speech synthesis in various services such as web services, smart phone apps, and campaigns. The voices can also express emotions (*) that will replace your image of the dull speech synthesis.

※*Available for some voices.

Main usage

Voice interaction application,Cloud phone service,News read aloud application

Provision type

Cloud / API / SaaS

AITalk^®WebAPI is recommended for people who

Are looking to start a service using speech
synthesis while minimizing the cost for
development and applications.

There are no needs to develop or operate speech synthesis server as the operation will be done by us.
You are able to save resources for the application as you do not have to develop and apply on your own.

Are looking to deploy a service using
speech synthesis in multi-devices.

It takes tremendous effort to integrate a speech synthesis engine into Android, iOS, WindowsCE, Windows, Mac, and etc.
AITalk^® WebAPI enables easy support for multi-devices.

Characteristics of AITalk^®WebAPI

Ability of expressing emotions（^*）

We are able to achieve expression of emotions suitable for each situation and use.

^*Available for some voices.
Human like natural voices

AITalk^®WebAPI makes possible creation of natural, human-like voices unlike the previous robot like speech synthesis.
A variety of voices

We have a total of 17 speakers consisting of 15 standard speakers and 2 Kansai dialect speakers that can be used for various purposes.
Original custom voices are also available

You can use an original sound dictionary of voices of celebrities or voice actors generated by the original speech synthesis dictionary service “AITalk CustomVoice.”
We have many plans suitable from startups to a large-scale services

Our plans are available for services of various scales, starting from the Minimum Start Plan at 5,000 yen per month to a large scale plan specifically dedicated for certain environments.
Simple and easy-to-use API

You can smoothly introduce our simple API service that is simple enough to use without a speech synthesis expertise.
A provision format that doesn’t require a specific development language

The development language does not matter as long as the network communication (REST format data communication) is possible.
Equipped with a tuning function

The inntonations of Proper nouns and technical terms can be registered from the user dictionary page.

Speakers Introduction

Nozomi

Corresponding Expression of Emotion: Normal, joy, anger, sadness Her voice is pleasant and youthful. Her voice can be used for various situations such as for narrations, automatic telephone answering system, wireless-activated disaster warning system, entertainment, etc.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Kaho

Her voice is extremely clear and easy to understand. Available for a wide range of use including automatic telephone answering (CTI, IVR) and narration for the making of animation.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Yumiko

Mature and calm voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Kanon

Sweet and cute voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Tsubasa

Firm and honest voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Akari

Her voice gives a cheerful and bright impression. Most suitable for the use of product guidance and promotions.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Nanako

Features a very calming voice. Her voice is best suited for reading news and audio guidance.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Shiori

Youthful and friendly voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Seiji

His voice has a very sincere tone. Suitable for persuasion and calling attention.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Osamu

His voice features high applicability. Applicable to various scenes.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Taichi

Corresponding Expression of Emotion: Normal, joy His voice gives a youthful and unique impression. Most suitable for using in the field of entertainment.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Kenta

A gentle, luminous and modest voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Anzu

Features a very loving and earnest voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Chihiro

A charming nasal voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Koutarou

Features a slow-paced and cute voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Yuuto

A brisk and intelligent sounding boy’s voice.

DNN
Unit-selection-based・with emotion

Normal
Joy
Anger
Sadness

Audio demonstration

Nozomi

Shiori

Seiji

Taichi

Kenta

Kaho

Yumiko

Unit-selection-based・
Without emotion

Kanon

Unit-selection-based・
Without emotion

Tsubasa

Unit-selection-based・
Without emotion

Akari

Unit-selection-based・
Without emotion

Nanako

Unit-selection-based・
Without emotion

Osamu

Unit-selection-based・
Without emotion

Anzu

Unit-selection-based・
Without emotion

Chihiro

Unit-selection-based・
Without emotion

Koutaro

Unit-selection-based・
Without emotion

Yuuto

Unit-selection-based・
Without emotion

Kansai Dialect

Miyabi

Unit-selection-based・
Without emotion

Kansai Dialect

Yamato

Unit-selection-based・
Without emotion

Speed

1
Pitch

1
Intonation

1

Anger

0
Sadness

0

Joy

0

Synthesize 合成中再生中 Stop

DNN音声辞書を利用する

* About the use of speech synthesis demontration

Secondary use of the speech synthesis demonstration provided on this website is prohibited.
In addition, use other than demonstration on this website is prohibited.
Also, please check the terms and conditions of this website.

Main Functions for AITalk^® WebAPI

Emotion adjustment function^*

※Speakers who correspond to emotion include Nozomi (Joy, Anger, and Sadness), Maki (Joy, Anger, and Sadness), Reina (Joy), Taichi (Joy) only.

Text to speech

Speed adjustment

It can adjust speed in the range between 0.5 – 4 times.

Intonation dictionary registration

※Paid option

Pitch adjustment function

It can adjust the pitch (tone of the voice) in the range between 0.5 – 2.0 times.

Word dictionary function

The user dictionary function registers and saves names of people and places that are read in special ways. You can register not only how to read but also the word intonation.

Volume adjustment

Voice selection

From children to adults, you can choose a voice from the total of 14 standard/Kansai-dialect speakers to suit each use case

※Word dictionary function is not available for Kansai-dialect speaker. We thank you for your understanding.

AITalk^® WebAPI Application Examples

Web campaign

– To plan unique and fun Web campaigns
– To carry out user-interactive campaigns using voices of celebrities and voice actors

＜If you use AITalk^® WebAPI×CustomVoice…＞

– You can use the voice of the user who is entering the text to read aloud the text input.

Read aloud news application

– To deliver a real-time reiteration service for up-to-date news without having to record.
– To develop an app in which users can listen to the news in a self-picked voice.

＜If you use AITalk^® WebAPI…＞

– It can reiterate the news in real-time without having a news presenter to read.
– You may choose from 4 men, 7 women, and 4 children, in total 15 speakers of your choice.

Voice dialogue

– To easily use speech synthesis while keeping the initial development and application cost low.
– To develop an app that makes possible conversations with various characters.

＜If you use AITalk^® WebAPI×CustomVoice…＞

– You can start a speech synthesis service while saving the initial development and application costs, such as server maintenance and monitoring costs.
– CustomVoice allows you to create an application that can converse in an original character voice.

Interactive Voice Response (IVR)

– To make AI automatically respond to variable information.
– To use speech synthesis in interactive voice response while saving the initial and application 　cost.

＜If you use AITalk® WebAPI…＞

– Responding to variable information in real-time becomes possible without having an operator to respond.
– You can affiliate with IVR to create speech synthesis while saving initial development and 　

Car navigation

– To navigate with voice more information other than the destination names.
–To start small for testing to see how much needs exist.

＜If you use AITalk® …＞

– Unrecorded information can be reiterated with voice.
– You can start from 5,000yen per month, therefore makes possible experimental use of the service.

Electronic books

– To add value to books by vocalizing the contents.
– To choose a voice suitable to read aloud the book.

＜If you use AITalk^® WebAPI…＞

– You can easily build a system in which an instant input of a book’s text information becomes vocalized.
– You can choose a voice most suitable for the situation from 4 men, 7 women, 2 boys, and 2 girls (all in Japanese).

Steps before use

Flow of introducing easy to use text to speech AITalk Koenoshokunin Cloud version

Step.1
Inquiry Form (10 days before use)

Please download the application form, terms and conditions, and account application form from the following and check the contents.

Download terms and conditions・application form・account application form.

If you agree to the terms and conditions, please fill out the application form and account application form and apply from form.

Step.2
Contact from AI

We will have a person in charge contact you back within two business days.

^*Please understand that depending on the results of our company examination we will not be able to provide you our service.

Step.3
Providing the ID and PW

We will issue you the ID and PW by email.

Step.4
Start of usage

You are able to start, based on the contents of your application plan. Please contact us for any further questions.

AITalk® WEBAPI［AICloud］

AITalk^® WEBAPI［AICloud］