Speech Portal: сентября 2013

четверг, 19 сентября 2013 г.

Speech Portal

Primetalk's Speech Portal is a platform for constructing sophisticated spoken dialog systems. It uses SynapseGrid as an integration platform and contains all the necessary components for a dialog system to speak with a human being:

Pipeline:

Speech signal processing.
Recognizer.
Parser.
Dialog manager.
Templater.
Synthesizer

Knowledge representation

A priori

Grammar expressions for input and output language.
Owl/rdf ontology import.

A posteriori

Strictly-typed frames.
Untyped frames.

Partial (at runtime) and uncertain

Fuzzy sets.
Incomplete frames.
Probability distributions.

Basic dialog logic building blocks

User input comprehension.
Context representation.
Output utterance proposals.

API for custom dialog logic

Extended Petry nets.
Composable functional speaking strategy.

API for real-time custom reactions.

Selected set of contacts (hooks) for getting all interesting real-time events.
Carefully designed replaceable parts of pipeline.

For recognizer part we support:

sphinx4 + our acoustic model or any voxforge models.
google speech api (unofficial).

For speech synthesis:

Festival + voice.
OpenMary + voice.
RhVoice + voice.
SpeechPro + voice.

Подписаться на: Сообщения (Atom)