Skip to content

Latest commit

 

History

History
 
 

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 
 
 
 
 

README.md

Google Cloud Platform logo

Google Cloud Speech API Node.js Samples

Build

The Cloud Speech API enables easy integration of Google speech recognition technologies into developer applications. Send audio and receive a text transcription from the Cloud Speech API service.

Table of Contents

Setup

  1. Read Prerequisites and How to run a sample first.

  2. Install dependencies:

    With npm:

    npm install
    

    With yarn:

    yarn install
    

Samples

Speech Recognition

View the documentation or the source code.

Usage: node recognize.js --help

Commands:
  sync <filename>                 Detects speech in a local audio file.
  sync-gcs <gcsUri>               Detects speech in an audio file located in a Google Cloud Storage bucket.
  sync-words <filename>           Detects speech in a local audio file with word time offset.
  sync-punctuation <filename>     Detects speech in a local audio file with automatic punctuation enabled.
  sync-video <filename>           Detects speech in a local audio file with original media type set to VIDEO.
  sync-metadata <filename>        Detects speech in a local audio file with audio metadata paramters.
  async <filename>                Creates a job to detect speech in a local audio file, and waits for the job to
                                  complete.
  async-gcs <gcsUri>              Creates a job to detect speech in an audio file located in a Google Cloud Storage
                                  bucket, and waits for the job to complete.
  async-gcs-words <gcsUri>        Creates a job to detect speech with word time offset in an audio file located in a
                                  Google Cloud Storage bucket, and waits for the job to complete.
  async-gcs-punctuation <gcsUri>  Creates a job to detect speech automatic punctuation enabled in an audio file located
                                  in a Google Cloud Storage bucket, and waits for the job to complete.
  async-gcs-video <gcsUri>        Creates a job to detect speech with original media type set to VIDEO in an audio file
                                  located in a Google Cloud Storage bucket, and waits for the job to complete.
  async-gcs-metadata <gcsUri>     Creates a job to detect speech with audio metadata paramaters in an audio file located
                                  in a Google Cloud Storage bucket, and waits for the job to complete.
  stream <filename>               Detects speech in a local audio file by streaming it to the Speech API.
  listen                          Detects speech in a microphone input stream. This command requires that you have SoX
                                  installed and available in your $PATH. See
                                  https://www.npmjs.com/package/node-record-lpcm16#dependencies

Options:
  --help                 Show help                                                                             [boolean]
  --encoding, -e                                                                          [string] [default: "LINEAR16"]
  --sampleRateHertz, -r                                                                        [number] [default: 16000]
  --languageCode, -l                                                                         [string] [default: "en-US"]

Examples:
  node recognize.js sync ./resources/audio.raw -e LINEAR16 -r 16000
  node recognize.js async-gcs gs://gcs-test-data/vr.flac -e FLAC -r 16000
  node recognize.js stream ./resources/audio.raw  -e LINEAR16 -r 16000
  node recognize.js listen

For more information, see https://cloud.google.com/speech/docs

Running the tests

  1. Set the GCLOUD_PROJECT and GOOGLE_APPLICATION_CREDENTIALS environment variables.

  2. Run the tests:

    With npm:

    npm test
    

    With yarn:

    yarn test