Define an API once in Babel. Implement or use an existing code generator to map the API definition into usable objects and functions in any programming language.
Being an API designer is tough. There are an innumerable number of protocols and serialization formats that two hosts can use to communicate. Today, JSON over HTTP is gaining popularity, but just a few years ago, XML was the standard. To compound the issue, developers need to support an increasing number of language-specific SDKs to gain wide adoption.
Babel seeks to:
- Define API endpoints in terms of input and output data types that can be consistently implemented in different protocols and languages.
- Use structs (product types) and tagged unions (sum types) as fundamental data types for modeling APIs flexibly, but strictly.
- Improve the visibility teams have into their APIs by centralizing specifications and documentation.
If we only had one protocol and one language BabelAPI wouldn't be needed, but unfortunately humanity was handicapped for good reason. See Why do we have multiple programming languages?
Babel makes no assumptions about the protocol layer being used to make API requests and return responses; its first use case is the Dropbox v2 API which operates over HTTP. Babel does not come with nor enforces any particular RPC framework.
Babel make some assumptions about the data types supported in the serialization format and target programming language. It's assumed that there is a capacity for representing dictionaries (unordered String Keys -> Value), lists, numeric types, and strings. The intention is for Babel to map to a multitude of serialization formats from JSON to Protocol Buffers.
Babel assumes that an operation (or API endpoint) can have its request and response types defined without relation to each other. In other words, the type of response does not change based on the input to the endpoint. An exception to this rule is afforded for error responses.
Download or clone BabelAPI, and run the following in its root directory:
$ sudo python setup.py install
This will install a script babelapi to your PATH that can be run from the
command line:
$ babelapi -h
If you did not run setup.py but have the Python package in your PYTHONPATH,
you can replace babelapi with python -m babelapi.cli as follows:
$ python -m babelapi.cli -h
You can compile an example babel and apply it to a documentation template:
$ babelapi example/api/v2_files.babel example/api/v2_users.babel example/template/docs
You can view the generated documentation using:
$ google-chrome example/template/docs/docs.html
There are three types of files.
Specifications define the data types and operations available in an API.
Headers define only data types available in an API. Headers can be included in spec files so that common data types can be re-used.
Code generators are Python modules that implement the abstract
babelapi.generator.generator.CodeGenerator class. BabelAPI automatically
detects subclasses and calls the generate() method. The code generator
has access to a self.api member variable which represents the spec as a
Python object.
A spec is composed of a namespace followed by zero or more includes and zero or more definitions:
Spec ::= Namespace Include* Definition*
Specs must begin with a namespace declaration:
Namespace ::= 'namespace' Identifier
Example:
namespace users
This is the namespace for all the operations and data types in the Spec file. It helps us separate different parts of the API like "files", "users", and "photos".
Use an include to make all definitions in a Header available:
Include ::= 'include' Identifier
Example:
include common
This will search for a file called common.babelh in the same directory
as the Spec.
There are four types of definitions available:
Definition ::= Alias | Struct | Union | Operation
A struct is a type made up of other types:
struct Space
doc:
The space quota info for a user.
quota UInt64:
The user's total quota allocation (bytes).
private UInt64:
The user's used quota outside of shared folders (bytes).
shared UInt64:
The user's used quota in shared folders (bytes).
example default
quota=1000000
private=1000
shared=500
A struct can optionally define a documentation string by declaring doc:.
The colon enters documentation mode and indicates that the following
text is free form. Documentation mode is terminated only by a line that has the
same indentation as the original "doc:" string.
After the documentation is a list of fields. Fields are formatted with the field
name first followed by the field type. To provide documentation for a field, use
: again,otherwise end the line with the field type.
Finally, examples can be declared. An example is declared by using the example
keyword followed by a label, and optionally text. By convention, "default" should
be used as the label name for an example that can be considered a good
representation of the general case for the type.
Types can also be composed of other types:
struct Team
doc:
Information relevant to a team.
name String:
The name of the team.
example default
name="Acme, Inc."
struct AccountInfo:
doc:
Information for a user's account.
display_name String:
The full name of a user.
space Space:
The user's quota.
is_paired Boolean:
Whether the user has a personal and business account.
team Team|Null:
If this paired account is a member of a team.
example default "Paired account"
display_name="Jon Snow"
is_paired=true
example unpaired "Unpaired account"
display_name="Jon Snow"
is_paired=false
team=null
Note in the preceding example that the AccountInfo.team field can be a
Team struct or Null. By default, fields do not accept null as a
valid value unless explicitly indicated.
A struct can also inherit from another struct using the extends keyword:
struct EntryInfo
doc:
A file or folder entry.
id String(max_length=40):
A unique identifier for the file.
path String:
Path to file or folder.
modified DbxTimestamp|Null:
The last time the file was modified on Dropbox, in the standard date
format (null for root folder).
is_deleted Boolean:
Whether the given entry is deleted.
struct FileInfo extends EntryInfo
doc:
Describes a file.
size UInt64:
File size in bytes.
mime_type String|Null:
The Internet media type determined by the file extension.
media_info MediaInfo optional:
Information specific to photo and video media.
example default
id="xyz123"
path="/Photos/flower.jpg"
size=1234
mime_type="image/jpg"
modified="Sat, 28 Jun 2014 18:23:21"
is_deleted=false
Note in the preceding example the use of the optional keyword which denotes
that the field may not be present. We do not conflate the optionality of a field
with the nullability of a field's data_type. However, these concepts may be
intentionally conflated in languages that don't maintain a strict difference.
The setting of default values for fields is best seen in the context of operations. Please see the example below default_value_example.
A union in Babel is a tagged union. In its field declarations, a tag name is followed by a data type:
struct PhotoInfo
doc:
Photo-specific information derived from EXIF data.
time_taken DbxTimestamp:
When the photo was taken.
lat_long List(data_type=Float32)|null:
The GPS coordinates where the photo was taken.
example default
time_taken="Sat, 28 Jun 2014 18:23:21"
lat_long=null
struct VideoInfo
doc:
Video-specific information derived from EXIF data.
time_taken DbxTimestamp:
When the photo was taken.
lat_long List(data_type=Float32)|null:
The GPS coordinates where the photo was taken.
duration Float32:
Length of video in milliseconds.
example default
time_taken="Sat, 28 Jun 2014 18:23:21"
lat_long=null
duration=3
union MediaInfo
doc:
Media specific information.
photo PhotoInfo
video VideoInfo
Tags that do not map to a type can be declared. The following example illustrates:
struct UpdateParentRev
doc:
On a write conflict, overwrite the existing file if the parent rev matches.
parent_rev String:
The revision to be updated.
auto_rename Boolean:
Whether the new file should be renamed on a conflict.
example default
parent_rev="abc123"
auto_rename=false
union WriteConflictPolicy
doc:
Policy for managing write conflicts.
reject:
On a write conflict, reject the new file.
overwrite:
On a write conflict, overwrite the existing file.
rename:
On a write conflict, rename the new file with a numerical suffix.
update_if_matching_parent_rev UpdateParentRev:
On a write conflict, overwrite the existing file.
These types exist without having to be declared:
- Boolean
- Integers: Int32, Int64, UInt32, UInt64
- Attributes
min_valueandmax_valuecan be set for more restrictive bounding.- Float32, Float64
- String
- Attributes
min_lengthandmax_lengthcan be set.
- Timestamp
- The
formatattribute must be set to define the format of the timestamp.
- List
- The
data_typemust be set to define the type of elements.
Sometimes we prefer to use an alias, rather than re-declaring a type over and over again. For example, the Dropbox API uses a special date format. We can create an alias called DbxTimestamp, which sets this format, and can be used in struct and union definitions:
alias DbxTimestamp = Timestamp(format="%a, %d %b %Y %H:%M:%S")
struct Example
doc:
An example.
created DbxTimestamp:
When this example was created.
Operations map to your API endpoints. You specify a list of data types for the request, and a list of data types for the response:
struct AccountInfoRequest
doc:
Input to request.
account_id String = "me":
A user's account identifier. Use "me" to get information for the
current account.
op GetInfo
doc:
Get user account information.
request
AccountInfoRequest
response
AccountInfo
Note that account_id was given a default value of "me". This is useful
for including in generated SDKs.
The following is an example of an endpoint with two request segments:
struct FileUploadRequest
doc:
Stub.
path String:
The full path to the file you want to write to. It should not point
to a folder.
write_conflict_policy WriteConflictPolicy:
Action to take if a file already exists at the specified path.
example default
path="Documents/plan.docx"
op Upload
doc:
Upload a file to dropbox.
request
FileUploadRequest
Binary
response
FileInfo
To help template writers tailor documentation to a language, we support stubs in documentation. Stubs are of the following format:
:tag:`value`
Supported tags are op, struct, field, and link.
- op
- A reference to an operation. Template writers should make a reference to the method that represents the operation.
- struct
- A reference to a struct. Template writers should make a reference to the class that represents the struct.
- field
- A reference to a field of a struct. It's intended for referencing parameters for functions, but its utility is still TBD.
- link
- A hyperlink. Template writers should convert this to the proper hyperlink format for the language.
A code generator is a Python class which will generate code for a target language given an API description. A code generator must satisfy the following conditions:
- The filename must have
.babelt.pyas its extension. For example,example.babelt.py - A class must exist in the file that extends the
babelapi.generator.generator.CodeGeneratorclass and implements the abstractgenerate()method. BabelAPI automatically detects subclasses and calls thegenerate()method.
Code generators have a self.api member variable. The object is an instance
of the babelapi.api.Api class. From this object, you can access all the
defined namespaces, data types, and operations. See the Python object definition
for more information.
The following examples can all be found in the babelapi/example/generator
folder.
We'll create a generator ex1.babelt.py that generates a file called
ex1.out. Each line in the file will be the name of a defined namespace:
from babelapi.generator.generator import CodeGenerator
class ExampleGenerator(CodeGenerator):
def generate(self):
"""Generates a file that lists each namespace."""
with self.output_to_relative_path('ex1.out'):
for namespace in self.api.namespaces.values():
self.emit_line(namespace.name)
We use output_to_relative_path() a member of CodeGenerator to specify
where the output of our emit*() calls go (See more emit_methods).
Run the generator from the root of the BabelAPI folder using the example specs we've provided:
$ babelapi example/api/dbx-core*.babel example/generator/ex1
Now examine the contents of the output:
$ cat example/generator/ex1/ex1.out files users
There are several emit*() methods that you can use from a CodeGenerator
that each serve a different purpose.
emit(s)- The input string is written to the output file.
emit_line(s, trailing_newline=True)- The current indentation level followed by the input string is written to the
output file. If
trailing_newlineis True (default) then a newline is written as well. emit_wrapped_lines(s, prefix='', width=80, trailing_newline=True, first_line_prefix=True)- The current indentation level followed by the input prefix (assuming
first_line_prefixisTrue) are written to the output file. The input string is then written into lines with each line starting with the indentation level and prefix. This is ideal for generating blocks of comments. emit_empty_line()- Writes an empty line to the output file.
emit_indent()- Writes the number of tabs or spaces for the current indentation level to the output file.
Now we'll create a Python module for each namespace. Each module will define
a noop() function:
from babelapi.generator.generator import CodeGenerator
class ExamplePythonGenerator(CodeGenerator):
def generate(self):
"""Generates a module for each namespace."""
for namespace in self.api.namespaces.values():
# One module per namespace is created. The module takes the name
# of the namespace.
with self.output_to_relative_path('{}.py'.format(namespace.name)):
self._generate_namespace_module(namespace)
def _generate_namespace_module(self, namespace):
self.emit_line('def noop():')
with self.indent():
self.emit_line('pass')
Note how we used the self.indent() context manager to increase the
indentation level by a default 4 spaces. If you want to use tabs instead,
set the tabs_for_indents class variable of your extended CodeGenerator
class to True.
Run the generator from the root of the BabelAPI folder using the example specs we've provided:
$ babelapi example/api/dbx-core*.babel example/generator/ex2
Now examine the contents of the output:
$ cat example/generator/ex2/files.py
def noop():
pass
$ cat example/generator/ex2/users.py
def noop():
pass
As a more advanced example, we'll define a generator that makes a Python class
for each struct in our specification. We'll extend from
MonolingualCodeGenerator, which enforces that a lang class variable is
declared:
from babelapi.data_type import Struct
from babelapi.generator.generator import CodeGeneratorMonolingual
from babelapi.lang.python import PythonTargetLanguage
class ExamplePythonGenerator(CodeGeneratorMonolingual):
# PythonTargetLanguage has helper methods for formatting class, obj
# and variable names (some languages use underscores to separate words,
# others use camelcase).
lang = PythonTargetLanguage()
def generate(self):
"""Generates a module for each namespace."""
for namespace in self.api.namespaces.values():
# One module per namespace is created. The module takes the name
# of the namespace.
with self.output_to_relative_path('{}.py'.format(namespace.name)):
self._generate_namespace_module(namespace)
def _generate_namespace_module(self, namespace):
for data_type in namespace.linearize_data_types():
if not isinstance(data_type, Struct):
# Do not handle Union types
continue
# Define a class for each struct
class_def = 'class {}(object):'.format(self.lang.format_class(data_type.name))
self.emit_line(class_def)
with self.indent():
if data_type.doc:
self.emit_line('"""')
self.emit_wrapped_lines(data_type.doc)
self.emit_line('"""')
self.emit_empty_line()
# Define constructor to take each field
self.emit_line('def __init__', trailing_newline=False)
args = ['self']
for field in data_type.fields:
args.append(self.lang.format_variable(field.name))
self._generate_func_arg_list(args)
self.emit(':')
self.emit_empty_line()
with self.indent():
if data_type.fields:
# Body of init should assign all init vars
for field in data_type.fields:
if field.doc:
self.emit_wrapped_lines(field.doc, prefix='# ')
member_name = self.lang.format_variable(field.name)
self.emit_line('self.{0} = {0}'.format(member_name))
else:
self.emit_line('pass')
self.emit_empty_line()
One new method of CodeGenerator that was used is generate_func_arg_list(args).
It helps you generate a list of arguments in a function declaration or invocation
enclosed by parentheses.
- Python
- Objective-C/iOS
- Java/Android
- Ruby
- PHP
- Web Docs
- Server Input Validation
- Server Output Validation
- Clients must accept new fields (ie. fields unknown to it), and ignore them.
- Server should be flexible on missing inputs (backwards compatibility) if a default value has been specified in the spec, but strict on what goes out.
From the King James version of the Bible:
4 And they said, Go to, let us build us a city and a tower, whose top may reach unto heaven; and let us make us a name, lest we be scattered abroad upon the face of the whole earth.
5 And the Lord came down to see the city and the tower, which the children of men builded.
6 And the Lord said, Behold, the people is one, and they have all one language; and this they begin to do: and now nothing will be restrained from them, which they have imagined to do.
7 Go to, let us go down, and there confound their language, that they may not understand one another's speech.
8 So the Lord scattered them abroad from thence upon the face of all the earth: and they left off to build the city.
9 Therefore is the name of it called Babel; because the Lord did there confound the language of all the earth: and from thence did the Lord scatter them abroad upon the face of all the earth.
—Genesis 11:4–9[1]