AITalk® WEBAPI[AICloud]

Ideal for WEB services! Utilizes a easy yet high-quality
speech synthesis

AITalk® WEBAPI[AICloud]

WEB API is a service that can utilize high-quality speech synthesis engine AITalk® in SaaS format. There is no need to develop and apply the server for speech synthesis on your own, you can easily start utilizing speech synthesis in various services such as web services, smart phone apps, and campaigns. The voices can also express emotions (*) that will replace your image of the dull speech synthesis.

※*Available for some voices.

AITalk® WEBAPI[AICloud]

Main usage

Voice interaction application,Cloud phone service,News read aloud application

Provision type

Cloud / API / SaaS

AITalk®WebAPI is recommended for people who

Are looking to start a service using speech
synthesis while minimizing the cost for
development and applications.

There are no needs to develop or operate speech synthesis server as the operation will be done by us.
You are able to save resources for the application as you do not have to develop and apply on your own.

Are looking to deploy a service using
speech synthesis in multi-devices.

It takes tremendous effort to integrate a speech synthesis engine into Android, iOS, WindowsCE, Windows, Mac, and etc.
AITalk® WebAPI enables easy support for multi-devices.

Characteristics of AITalk®WebAPI

  • Ability of expressing emotions(*

    We are able to achieve expression of emotions suitable for each situation and use.

    *Available for some voices.

  • Human like natural voices

    AITalk®WebAPI makes possible creation of natural, human-like voices unlike the previous robot like speech synthesis.

  • A variety of voices

    We have a total of 17 speakers consisting of 15 standard speakers and 2 Kansai dialect speakers that can be used for various purposes.

  • Original custom voices are also available

    You can use an original sound dictionary of voices of celebrities or voice actors generated by the original speech synthesis dictionary service “AITalk CustomVoice.”

  • We have many plans suitable from startups to a large-scale services

    Our plans are available for services of various scales, starting from the Minimum Start Plan at 5,000 yen per month to a large scale plan specifically dedicated for certain environments.

  • Simple and easy-to-use API

    You can smoothly introduce our simple API service that is simple enough to use without a speech synthesis expertise.

  • A provision format that doesn’t require a specific development language

    The development language does not matter as long as the network communication (REST format data communication) is possible.

  • Equipped with a tuning function

    The inntonations of Proper nouns and technical terms can be registered from the user dictionary page.

Speakers Introduction

Nozomi

Nozomi

Corresponding Expression of Emotion: Normal, joy, anger, sadness Her voice is pleasant and youthful. Her voice can be used for various situations such as for narrations, automatic telephone answering system, wireless-activated disaster warning system, entertainment, etc.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Kaho

Kaho

Her voice is extremely clear and easy to understand. Available for a wide range of use including automatic telephone answering (CTI, IVR) and narration for the making of animation.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Yumiko

Yumiko

Mature and calm voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Kanon

Kanon

Sweet and cute voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Tsubasa

Tsubasa

Firm and honest voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Akari

Akari

Her voice gives a cheerful and bright impression. Most suitable for the use of product guidance and promotions.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Nanako

Nanako

Features a very calming voice. Her voice is best suited for reading news and audio guidance.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Shiori

Shiori

Youthful and friendly voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Seiji

Seiji

His voice has a very sincere tone. Suitable for persuasion and calling attention.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Osamu

Osamu

His voice features high applicability. Applicable to various scenes.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Taichi

Taichi

Corresponding Expression of Emotion: Normal, joy His voice gives a youthful and unique impression. Most suitable for using in the field of entertainment.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Kenta

Kenta

A gentle, luminous and modest voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Anzu

Anzu

Features a very loving and earnest voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Chihiro

Chihiro

A charming nasal voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Koutarou

Koutarou

Features a slow-paced and cute voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness
Yuuto

Yuuto

A brisk and intelligent sounding boy’s voice.

  • DNN
  • Unit-selection-based・with emotion
  • Normal
  • Joy
  • Anger
  • Sadness

Audio demonstration

  • Speed

    1

  • Pitch

    1

  • Intonation

    1

  • Anger

    0

  • Sadness

    0

  • Joy

    0

Synthesize 合成中 再生中 Stop

* About the use of speech synthesis demontration

Secondary use of the speech synthesis demonstration provided on this website is prohibited.
In addition, use other than demonstration on this website is prohibited.
Also, please check the terms and conditions of this website.

Main Functions for AITalk® WebAPI

Emotion adjustment function *

Emotion adjustment function*

※Speakers who correspond to emotion include Nozomi (Joy, Anger, and Sadness), Maki (Joy, Anger, and Sadness), Reina (Joy), Taichi (Joy) only.

Text to speech

Text to speech

Speed adjustment

Speed adjustment

It can adjust speed in the range between 0.5 – 4 times.

Intonation dictionary registration

Intonation dictionary registration

※Paid option

Pitch adjustment function

Pitch adjustment function

It can adjust the pitch (tone of the voice) in the range between 0.5 – 2.0 times.

Word dictionary function

Word dictionary function

The user dictionary function registers and saves names of people and places that are read in special ways. You can register not only how to read but also the word intonation.

Volume adjustment

Volume adjustment

Voice selection

Voice selection

From children to adults, you can choose a voice from the total of 14 standard/Kansai-dialect speakers to suit each use case

※Word dictionary function is not available for Kansai-dialect speaker. We thank you for your understanding.

AITalk® WebAPI Application Examples

Web campaign

– To plan unique and fun Web campaigns
– To carry out user-interactive campaigns using voices of celebrities and voice actors

<If you use AITalk® WebAPI×CustomVoice…>

– You can use the voice of the user who is entering the text to read aloud the text input.

Web campaign

Read aloud news application

– To deliver a real-time reiteration service for up-to-date news without having to record.
– To develop an app in which users can listen to the news in a self-picked voice.

<If you use AITalk® WebAPI…>

– It can reiterate the news in real-time without having a news presenter to read.
– You may choose from 4 men, 7 women, and 4 children, in total 15 speakers of your choice.

Read aloud news application

Voice dialogue

– To easily use speech synthesis while keeping the initial development and application cost low.
– To develop an app that makes possible conversations with various characters.

<If you use AITalk® WebAPI×CustomVoice…>

– You can start a speech synthesis service while saving the initial development and application costs, such as server maintenance and monitoring costs.
– CustomVoice allows you to create an application that can converse in an original character voice.

Voice dialogue

Interactive Voice Response (IVR)

– To make AI automatically respond to variable information.
– To use speech synthesis in interactive voice response while saving the initial and application  cost.

<If you use AITalk® WebAPI…>

– Responding to variable information in real-time becomes possible without having an operator to respond.
– You can affiliate with IVR to create speech synthesis while saving initial development and  

Interactive Voice Response (IVR)

Car navigation

– To navigate with voice more information other than the destination names.
–To start small for testing to see how much needs exist.

<If you use AITalk® …>

– Unrecorded information can be reiterated with voice.
– You can start from 5,000yen per month, therefore makes possible experimental use of the service.

Car navigation

Electronic books

– To add value to books by vocalizing the contents.
– To choose a voice suitable to read aloud the book.

<If you use AITalk® WebAPI…>

– You can easily build a system in which an instant input of a book’s text information becomes vocalized.
– You can choose a voice most suitable for the situation from 4 men, 7 women, 2 boys, and 2 girls (all in Japanese).

Electronic books

Steps before use

Flow of introducing easy to use text to speech AITalk Koenoshokunin Cloud version

Step.1
Inquiry Form (10 days before use)

Please download the application form, terms and conditions, and account application form from the following and check the contents.

Download terms and conditions・application form・account application form.

If you agree to the terms and conditions, please fill out the application form and account application form and apply from form.

Step.2
Contact from AI

We will have a person in charge contact you back within two business days.

*Please understand that depending on the results of our company examination we will not be able to provide you our service.

Step.3
Providing the ID and PW

We will issue you the ID and PW by email.

Step.4
Start of usage

You are able to start, based on the contents of your application plan. Please contact us for any further questions.