Open source visualization tools

From Redash to Metabase, here are a few open source data visualization tools.

Posted by Allan Situma on March 30, 2020 · 8 mins read
There is a magic in graphs. The profile of a curve reveals in a flash a whole situation — the life history of an epidemic, a panic, or an era of prosperity. The curve informs the mind, awakens the imagination, convinces. - Henry D. Hubbard

Business intelligence

The purpose of BI is to support and facilitate decision making in the entire organization. The BI team does this by combining data from multiple sources, aggregate, analyze , visualize and share this information with the relevant stake holders.

This article will focus on the visualization function of business intelligence. We will take a look at various open source solution that are simple to set up, use and produce good reports

Business Intelligence (BI) Tools for Data Visualization

Business Intelligence (BI) tools are essential for visualizing data to support decision-making across organizations. When it comes to open source BI solutions, simplicity of setup, ease of use, and the ability to produce high-quality reports are key considerations. Here are three popular open source BI tools that excel in these areas:

1. Redash

Description: Redash is an open source tool designed for data visualization and dashboard creation. It helps users query their data sources and create visual insights easily.

Key Features:

  • Data Sources: Connects to a wide variety of data sources including SQL databases, NoSQL databases, APIs, and cloud services.
  • Query Editor: Provides a robust SQL editor with support for multiple query tabs, snippets, and auto-complete.
  • Visualization: Offers a range of visualization types, including charts, graphs, and maps.
  • Dashboards: Allows users to combine multiple visualizations into interactive dashboards.
  • Collaboration: Facilitates sharing of dashboards and queries among team members with control over access permissions.

Advantages:

  • Easy to set up and use.
  • Strong community support.
  • Flexible integration with various data sources.

Use Cases:

  • Ad-hoc querying and visualization.
  • Building interactive dashboards for different stakeholders.
  • Monitoring and alerting based on data changes.

Installation using Docker:

  1. Clone the Redash repository:
  2. git clone https://github.com/getredash/redash.git
    cd redash
  3. Run the Docker setup:
  4. docker-compose up -d
  5. Access Redash:
  6. Open your browser and go to http://localhost:5000. Follow the setup instructions to create an admin account and start using Redash.

2. Apache Superset

Description: Apache Superset is a modern, enterprise-ready business intelligence web application that allows users to explore and visualize their data through a rich interface.

Key Features:

  • Data Exploration: Users can explore data interactively, slice and dice it, and drill down into details.
  • Visualizations: A wide array of visualizations are supported, including time-series charts, pie charts, histograms, and more.
  • SQL Editor: Built-in SQL IDE for advanced users to create complex queries.
  • Dashboards: Create and share interactive dashboards with real-time data updates.
  • Security: Granular security and authentication features for enterprise use.

Advantages:

  • Scalable and suitable for large datasets.
  • Rich visualization options.
  • Strong integration with SQL databases.

Use Cases:

  • Comprehensive data exploration and analysis.
  • Developing complex dashboards for executive reporting.
  • Integrating BI capabilities within existing data workflows.

Installation using Docker:

  1. Clone the Superset repository:
  2. git clone https://github.com/apache/superset.git
    cd superset
  3. Run the Docker setup:
  4. docker-compose -f docker-compose-non-dev.yml pull
    docker-compose -f docker-compose-non-dev.yml up
  5. Initialize the database and create an admin user:
  6. docker exec -it superset_app bash
    superset db upgrade
    export FLASK_APP=superset
    superset fab create-admin --username admin --firstname Admin --lastname User --email admin@superset.com --password admin
    superset init
    exit
  7. Access Superset:
  8. Open your browser and go to http://localhost:8088. Log in with the admin credentials created above.

3. Metabase

Description: Metabase is an open source BI tool that makes analytics accessible to everyone in an organization, not just data scientists and analysts.

Key Features:

  • Ease of Use: Simple and intuitive interface with no need for SQL knowledge for basic operations.
  • Query Builder: A point-and-click interface for creating queries without writing SQL, though SQL mode is available for advanced users.
  • Visualizations: Provides a variety of visualization options, including bar charts, line graphs, and maps.
  • Dashboards: Users can create and share dashboards that automatically update as new data comes in.
  • Sharing and Embedding: Easy sharing of results and embedding of charts in other web applications.

Advantages:

  • Extremely user-friendly, suitable for non-technical users.
  • Quick setup with a minimal learning curve.
  • Active community and regular updates.

Use Cases:

  • Democratizing data access within organizations.
  • Rapid dashboard creation for various business needs.
  • Embedding analytics in web applications and portals.

Installation using Docker:

  1. Pull the Metabase Docker image:
  2. docker pull metabase/metabase
  3. Run the Docker container:
  4. docker run -d -p 3000:3000 --name metabase metabase/metabase
  5. Access Metabase:
  6. Open your browser and go to http://localhost:3000. Follow the setup instructions to create an admin account and start using Metabase.

Conclusion

For organizations looking to leverage open source tools for data visualization as part of their BI strategy, Redash, Apache Superset, and Metabase offer powerful yet user-friendly solutions. Each tool has unique strengths:

  • Redash is great for teams that need flexibility in querying and strong integration with various data sources.
  • Apache Superset is ideal for large-scale, enterprise-level BI needs with a focus on data exploration and rich visualizations.
  • Metabase is perfect for making data accessible across the entire organization, providing a simple interface for non-technical users.

Comparison and Choosing the Right Tool

When selecting the right open source BI tool for your organization, consider the following aspects:

1. Ease of Use:

Metabase stands out for its user-friendly interface and is suitable for non-technical users who need quick insights without deep technical knowledge.

2. Scalability and Enterprise Features:

Apache Superset is designed for scalability and offers robust security features, making it ideal for large enterprises with complex BI needs.

3. Flexibility

Redash offers flexibility in querying and supports a wide range of data sources, making it a good choice for teams that need versatile data integration.

Additional Considerations

Community and Support:

All three tools have active communities, but the level of support and frequency of updates can vary. Metabase and Redash are known for their active community engagement, while Apache Superset benefits from the broader Apache Software Foundation support.

Customization and Extensibility:

If you need to customize or extend the functionality, Apache Superset offers more advanced options for developers, while Redash and Metabase provide sufficient customization for most standard use cases.

Deployment Flexibility:

Docker installations are straightforward for all three tools, but each tool also supports other deployment options, such as cloud-based setups, which can be explored based on your organization’s infrastructure.

By carefully considering your organization’s specific needs, technical expertise, and existing infrastructure, you can choose the most suitable open source BI tool to empower data-driven decision-making and create high-quality, insightful reports.

Demo Image To go places and do things that have never been done before – that’s what living is all about.