PostHog: Open-source product analytics and feature management
Event tracking, analytics, and experimentation platform.
Learn more about PostHog
PostHog is an open-source analytics and product management platform written in Python and JavaScript/TypeScript that can be deployed as a cloud service or self-hosted instance. The system captures user events through autocapture or manual instrumentation, stores them in a queryable data warehouse, and provides multiple interfaces for analysis including visualization tools and SQL access. It includes modules for session replay recording, feature flag management with cohort targeting, statistical experiment frameworks, and data pipeline processing with integrations to external services. Typical deployments range from small self-hosted hobby instances to large-scale cloud deployments handling millions of events monthly.
Unified feature set
Combines analytics, session replay, feature flags, experiments, and data pipelines in a single platform rather than requiring separate tools. This integration allows cross-referencing data across different product intelligence domains.
Self-hosting option
Offers a deployable open-source version that can run on infrastructure controlled by the user, with a documented hobby deployment process for single-machine setups and support for larger distributed deployments.
SQL-based querying
Provides direct SQL access to event data alongside visual analytics interfaces, allowing users to write custom queries against the underlying data warehouse for ad-hoc analysis.
import posthog
posthog.project_api_key = 'your_api_key'
posthog.host = 'https://app.posthog.com'
posthog.capture(
distinct_id='user_123',
event='button_clicked',
properties={'button_name': 'signup', 'page': 'homepage'}
)See how people are using PostHog
Top in Data Engineering
Related Repositories
Discover similar tools and frameworks used by developers
dbt
SQL-based transformation framework for analytics data warehouses.
Fiona
Python library for reading and writing geographic data files like GeoPackage and Shapefile.
Neo4j
Open-source graph database storing data as nodes and relationships with Cypher query language.
COVID-19 Data
Archived NYT dataset of coronavirus cases and deaths across U.S. counties and states (2020-2023).
pdfplumber
Python library for extracting PDF text and tables.