Documentation
Data modeling
Polymorphic cubes

Polymorphic cubes

In programming languages, polymorphism usually means the use of a single symbol to represent multiple different types. It can be quite common for a database and application to be designed in such a way that leverages a single database table for entities of different types that share common traits.

For example, you are working on an online education platform, where teachers assign lessons to students. The database can contain only two tables: one for users and another one for lessons. The users table can contain a type column, with possible values teacher or student. Here is how it could look:

idtypenameschool
1studentCarl AndersonBalboa High School
2studentLuke SkywalkerBalboa High School
31teacherJohn DoeBalboa High School

Lessons are assigned by teachers and completed by students. The lessons table has both teacher_id and student_id, which are actually references to the user id. The lessons table can look like this:

idteacher_idstudent_idname
100311Multiplication and the meaning of the Factors
101312Division as an Unknown Factor Problem

The best way to design such a data model is by using what we call Polymorphic Cubes. It relies on the extends feature and prevents you from duplicating code, while preserving the correct domain logic. Learn more about using extends here.

The first step is to create a user cube, which will act as a base cube for our teachers and students cubes and will contain all common measures and dimensions:

YAML
JavaScript
cubes:  - name: users  sql: SELECT * FROM users    measures:  - name: count  type: count    dimensions:  - name: name  sql: name  type: string    - name: school  sql: school  type: string

Then you can derive the teachers and students cubes from users:

YAML
JavaScript
cubes:  - name: teachers  extends: users  sql: |  SELECT *  FROM {users.sql()}  WHERE type = 'teacher'    - name: students  extends: users  sql: |  SELECT *  FROM {users.sql()}  WHERE type = 'student'

Once we have those cubes, we can define correct joins from the lessons cube:

YAML
JavaScript
cubes:  - name: lessons  sql_table: lessons    joins:  - name: students  relationship: many_to_one  sql: "{CUBE}.student_id = {students.id}"    - name: teachers  relationship: many_to_one  sql: "{CUBE}.teacher_id = {teachers.id}"