infi.clickhouse_orm/README.rst

166 lines
4.2 KiB
ReStructuredText
Raw Normal View History

2016-06-26 15:11:23 +03:00
Overview
========
This project is simple ORM for working with the `ClickHouse database <https://clickhouse.yandex/>`_.
It allows you to define model classes whose instances can be written to the database and read from it.
Installation
============
To install infi.clickhouse_orm::
pip install infi.clickhouse_orm
Usage
=====
Defining Models
---------------
Models are defined in a way reminiscent of Django's ORM:
.. code:: python
from infi.clickhouse_orm import models, fields, engines
class Person(models.Model):
first_name = fields.StringField()
last_name = fields.StringField()
birthday = fields.DateField()
height = fields.Float32Field()
engine = engines.MergeTree('birthday', ('first_name', 'last_name', 'birthday'))
2016-06-26 16:52:25 +03:00
It is possible to provide a default value for a field, instead of its "natural" default (empty string for string fields, zero for numeric fields etc.).
2016-06-26 15:11:23 +03:00
2016-06-26 16:52:25 +03:00
See below for the supported field types and table engines.
2016-06-26 15:11:23 +03:00
Using Models
------------
Once you have a model, you can create model instances:
.. code:: python
>>> dan = Person(first_name='Dan', last_name='Schwartz')
>>> suzy = Person(first_name='Suzy', last_name='Jones')
>>> dan.first_name
u'Dan'
2016-06-26 16:52:25 +03:00
When values are assigned to model fields, they are immediately converted to their Pythonic data type.
In case the value is invalid, a ``ValueError`` is raised:
2016-06-26 15:11:23 +03:00
.. code:: python
>>> suzy.birthday = '1980-01-17'
>>> suzy.birthday
datetime.date(1980, 1, 17)
>>> suzy.birthday = 0.5
ValueError: Invalid value for DateField - 0.5
>>> suzy.birthday = '1922-05-31'
ValueError: DateField out of range - 1922-05-31 is not between 1970-01-01 and 2038-01-19
2016-06-26 16:52:25 +03:00
Inserting to the Database
-------------------------
To write your instances to ClickHouse, you need a ``Database`` instance:
2016-06-26 15:11:23 +03:00
.. code:: python
from infi.clickhouse_orm.database import Database
db = Database('my_test_db')
This automatically connects to http://localhost:8123 and creates a database called my_test_db, unless it already exists.
If necessary, you can specify a different database URL and optional credentials:
.. code:: python
db = Database('my_test_db', db_url='http://192.168.1.1:8050', username='scott', password='tiger')
2016-06-26 16:52:25 +03:00
Using the ``Database`` instance you can create a table for your model, and insert instances to it:
2016-06-26 15:11:23 +03:00
.. code:: python
db.create_table(Person)
db.insert([dan, suzy])
2016-06-26 16:52:25 +03:00
The ``insert`` method can take any iterable of model instances, but they all must belong to the same model class.
Reading from the Database
-------------------------
Loading model instances from the database is simple:
.. code:: python
for person in db.select("SELECT * FROM my_test_db.person", model_class=Person):
print person.first_name, person.last_name
Do not include a ``FORMAT`` clause in the query, since the ORM automatically sets the format to ``TabSeparatedWithNamesAndTypes``.
It is possible to select only a subset of the columns, and the rest will receive their default values:
.. code:: python
2016-06-26 15:11:23 +03:00
2016-06-26 16:52:25 +03:00
for person in db.select("SELECT first_name FROM my_test_db.person WHERE last_name='Smith'", model_class=Person):
print person.first_name
2016-06-26 15:11:23 +03:00
2016-06-26 16:52:25 +03:00
Specifying a model class is not required. In case you do not provide a model class, an ad-hoc class will
be defined based on the column names and types returned by the query:
.. code:: python
for row in db.select("SELECT max(height) as max_height FROM my_test_db.person"):
print row.max_height
Counting
--------
The ``Database`` class also supports counting records easily:
.. code:: python
>>> db.count(Person)
117
>>> db.count(Person, conditions="height > 1.90")
6
2016-06-26 15:11:23 +03:00
Field Types
-----------
Currently the following field types are supported:
- UInt8Field
- UInt16Field
- UInt32Field
- UInt64Field
- Int8Field
- Int16Field
- Int32Field
- Int64Field
- Float32Field
- Float64Field
- StringField
- DateField
- DateTimeField
2016-06-26 16:52:25 +03:00
Table Engines
-------------
TBD
2016-06-26 15:11:23 +03:00
Development
===========
After cloning the project, run the following commands::
easy_install -U infi.projector
cd infi.clickhouse_orm
projector devenv build
To run the tests, ensure that the ClickHouse server is running on http://localhost:8123/ (this is the default), and run::
2016-06-26 16:52:25 +03:00
bin/nosetests