Open captions driven by REST API calls
On Linux operating systems Wowza installation ships with OpenJDK that does not include fonts. On Linux
you need to manually install fontmanager and at least one font openttf-dejavu to make open captions work.
To get the REST API up and running, you need to add the below HTTP provider to your VHost.xml
file:
<HTTPProvider>
<BaseClass>com.raskenlund.caption.HttpProviderCaptionApi</BaseClass>
<RequestFilters>captions*</RequestFilters>
<AuthenticationMethod>none</AuthenticationMethod>
</HTTPProvider>
You need to add ModuleLiveOpenCaptions
module to your Application.xml
<Modules>
...
<Module>
<Name>ModuleLiveOpenCaptions</Name>
<Description>ModuleLiveOpenCaptions</Description>
<Class>com.raskenlund.caption.ModuleLiveOpenCaptions</Class>
</Module>
</Modules>
You have to enable transcoder in the application via the Engine Manager UI or directly in the xml:
<Transcoder>
<!-- To turn on transcoder set to: transcoder -->
<LiveStreamTranscoder>transcoder</LiveStreamTranscoder>
You also need to set the raskenlund.804.license
application property:
<Property>
<Name>raskenlund.804.license</Name>
<Value>W804-1038-xxxx-xxxx-xxxx</Value>
<Type>String</Type>
</Property>
Assuming your application is called live
and your incoming stream is called myStream
, you can
trigger a test caption using curl:
curl --location --request POST "http://localhost:1935/captions/v1/applications/live/streams/myStream/caption"
\ --header "Content-Type: application/json" --data-raw "{
\"captions\": [
{
\"textData\": \"Hello World, this is the captioning module\",
\"language\": \"en\"
}
],
\"delay\": 0
\}"
CEA608 captions driven by REST API calls
To get the REST API up and running, you need to add the below HTTP provider to your VHost.xml
file:
<HTTPProvider>
<BaseClass>com.raskenlund.caption.HttpProviderCaptionApi</BaseClass>
<RequestFilters>captions*</RequestFilters>
<AuthenticationMethod>none</AuthenticationMethod>
</HTTPProvider>
You need to add ModuleOnTextDataToCEA608
module to your Application.xml
<Module>
<Name>ModuleOnTextDataToCEA608</Name>
<Description>ModuleOnTextDataToCEA608</Description>
<Class>com.raskenlund.caption.ModuleOnTextDataToCEA608</Class>
</Module>
You need to set the raskenlund.804.license
application property:
<Property>
<Name>raskenlund.804.license</Name>
<Value>W804-1038-xxxx-xxxx-xxxx</Value>
<Type>String</Type>
</Property>
You need to configure the below timed text settings in the application:
<TimedText>
<VODTimedTextProviders></VODTimedTextProviders>
<!-- Properties for TimedText -->
<Properties>
<Property>
<Name>captionLiveIngestType</Name>
<Value>onTextData</Value>
<Type>String</Type>
</Property>
<Property>
<Name>captionLiveIngestLanguages</Name>
<Value>eng,nld,nor,hun</Value>
<Type>String</Type>
</Property>
</Properties>
</TimedText>
WebVTT captions driven by REST API calls
Please follow the setup for the CEA608 captions, except that you should not add ModuleOnTextDataToCEA608
to the application, but configure the below timed text property instead:
<TimedText>
<VODTimedTextProviders></VODTimedTextProviders>
<!-- Properties for TimedText -->
<Properties>
<Property>
<Name>cupertinoLiveCaptionsUseWebVTT</Name>
<Value>true</Value>
<Type>Boolean</Type>
</Property>
Speech To Text with AWS Transcribe
You need to add ModuleLiveTranscribe
module to your Application.xml
<Module>
<Name>ModuleLiveTranscribe</Name>
<Description>ModuleLiveTranscribe</Description>
<Class>com.raskenlund.caption.ModuleLiveTranscribe</Class>
</Module>
And you must enable transcoder in the application via the Engine Manager UI or directly in the xml. If you
do not need video transcoding, you can use the “audioonly” template so that video transcoding is disabled:
<Transcoder>
<!-- To turn on transcoder set to: transcoder -->
<LiveStreamTranscoder>transcoder</LiveStreamTranscoder>
<Templates>audioonly.xml</Templates>
Please make sure that at least one Encode`
in your active transcoder profiles have the
audio codec set to something other than PassThru`
. If no audio is transcoded, the plugin
is not able to capture audio frames and send them to AWS, which will time out the
transcoding session after 15 seconds without audio data.