jupyter

jupytext

kernelspec

language_info

plotly

notebook_metadata_filter

text_representation

all

extension	format_name	format_version	jupytext_version
.md	markdown	1.1	1.2.1

display_name	language	name
Python 3	python	python3

codemirror_mode

file_extension

mimetype

name

nbconvert_exporter

pygments_lexer

version

name	version
ipython	3

.py

text/x-python

python

ipython3

3.7.3

description	display_as	language	layout	name	order	page_type	permalink	redirect_from	thumbnail
How to make Histograms in Python with Plotly.	statistical	python	base	Histograms	3	example_index	python/histograms/	/python/histogram-tutorial/	thumbnail/histogram.jpg

Histogram with Plotly Express

In statistics, a histogram is representation of the distribution of numerical data, where the data are binned and the count for each bin is represented. More generally, in plotly a histogram is an aggregated bar chart, with several possible aggregation functions (e.g. sum, average, count...). Also, the data to be binned can be numerical data but also categorical or date data.

Plotly Express is the easy-to-use, high-level interface to Plotly, which operates on "tidy" data.

import plotly.express as px
tips = px.data.tips()
fig = px.histogram(tips, x="total_bill")
fig.show()

import plotly.express as px
tips = px.data.tips()
# Here we use a column with categorical data
fig = px.histogram(tips, x="day")
fig.show()

Choosing the number of bins

By default, the number of bins is chosen so that this number is comparable to the typical number of samples in a bin. This number can be customized, as well as the range of values.

import plotly.express as px
tips = px.data.tips()
fig = px.histogram(tips, x="total_bill", nbins=20)
fig.show()

Type of normalization

The default mode is to represent the count of samples in each bin. With the histnorm argument, it is also possible to represent the percentage or fraction of samples in each bin (histnorm='percent' or probability), or a density histogram (the sum of bars is equal to 100, density), or a probability density histogram (sum equal to 1, probability density).

import plotly.express as px
tips = px.data.tips()
fig = px.histogram(tips, x="total_bill", histnorm='probability density')
fig.show()

Aspect of the histogram plot

import plotly.express as px
tips = px.data.tips()
fig = px.histogram(tips, x="total_bill",
                   title='Histogram of bills',
                   labels={'total_bill':'total bill'}, # can specify one label per df column
                   opacity=0.8,
                   log_y=True, # represent bars with log scale
                   color_discrete_sequence=['indianred'] # color of histogram bars
                   )
fig.show()

Several histograms for the different values of one column

import plotly.express as px
tips = px.data.tips()
fig = px.histogram(tips, x="total_bill", color="sex")
fig.show()

Using histfunc

For each bin of x, one can compute a function of data using histfunc. The argument of histfunc is the dataframe column given as the y argument. Below the plot shows that the average tip increases with the total bill.

import plotly.express as px
tips = px.data.tips()
fig = px.histogram(tips, x="total_bill", y="tip", histfunc='avg')
fig.show()

Visualizing the distribution

With the marginal keyword, a subplot is drawn alongside the histogram, visualizing the distribution. See the distplot pagefor more examples of combined statistical representations.

import plotly.express as px
tips = px.data.tips()
fig = px.histogram(tips, x="total_bill", color="sex", marginal="rug", # can be `box`, `violin`
                         hover_data=tips.columns)
fig.show()

Histograms with go.Histogram

If Plotly Express does not provide a good starting point, it is also possible to use the more generic go.Histogram from plotly.graph_objects. All of the available histogram options are described in the histogram section of the reference page: https://plot.ly/python/reference#histogram.

Basic Histogram

import plotly.graph_objects as go

import numpy as np
np.random.seed(1)

x = np.random.randn(500)

fig = go.Figure(data=[go.Histogram(x=x)])
fig.show()

Normalized Histogram

import plotly.graph_objects as go

import numpy as np

x = np.random.randn(500)
fig = go.Figure(data=[go.Histogram(x=x, histnorm='probability')])

fig.show()

Horizontal Histogram

import plotly.graph_objects as go

import numpy as np

y = np.random.randn(500)
# Use `y` argument instead of `x` for horizontal histogram

fig = go.Figure(data=[go.Histogram(y=y)])
fig.show()

Overlaid Histogram

import plotly.graph_objects as go

import numpy as np

x0 = np.random.randn(500)
# Add 1 to shift the mean of the Gaussian distribution
x1 = np.random.randn(500) + 1

fig = go.Figure()
fig.add_trace(go.Histogram(x=x0))
fig.add_trace(go.Histogram(x=x1))

# Overlay both histograms
fig.update_layout(barmode='overlay')
# Reduce opacity to see both histograms
fig.update_traces(opacity=0.75)
fig.show()

Stacked Histograms

import plotly.graph_objects as go

import numpy as np

x0 = np.random.randn(2000)
x1 = np.random.randn(2000) + 1

fig = go.Figure()
fig.add_trace(go.Histogram(x=x0))
fig.add_trace(go.Histogram(x=x1))

# The two histograms are drawn on top of another
fig.update_layout(barmode='stack')
fig.show()

Styled Histogram

import plotly.graph_objects as go

import numpy as np
x0 = np.random.randn(500)
x1 = np.random.randn(500) + 1

fig = go.Figure()
fig.add_trace(go.Histogram(
    x=x0,
    histnorm='percent',
    name='control', # name used in legend and hover labels
    xbins=dict( # bins used for histogram
        start=-4.0,
        end=3.0,
        size=0.5
    ),
    marker_color='#EB89B5',
    opacity=0.75
))
fig.add_trace(go.Histogram(
    x=x1,
    histnorm='percent',
    name='experimental',
    xbins=dict(
        start=-3.0,
        end=4,
        size=0.5
    ),
    marker_color='#330C73',
    opacity=0.75
))

fig.update_layout(
    title_text='Sampled Results', # title of plot
    xaxis_title_text='Value', # xaxis label
    yaxis_title_text='Count', # yaxis label
    bargap=0.2, # gap between bars of adjacent location coordinates
    bargroupgap=0.1 # gap between bars of the same location coordinates
)

fig.show()

Cumulative Histogram

import plotly.graph_objects as go

import numpy as np

x = np.random.randn(500)
fig = go.Figure(data=[go.Histogram(x=x, cumulative_enabled=True)])

fig.show()

Specify Aggregation Function

import plotly.graph_objects as go

x = ["Apples","Apples","Apples","Oranges", "Bananas"]
y = ["5","10","3","10","5"]

fig = go.Figure()
fig.add_trace(go.Histogram(histfunc="count", y=y, x=x, name="count"))
fig.add_trace(go.Histogram(histfunc="sum", y=y, x=x, name="sum"))

fig.show()

Custom Binning

For custom binning along x-axis, use the attribute nbinsx. Please note that the autobin algorithm will choose a 'nice' round bin size that may result in somewhat fewer than nbinsx total bins. Alternatively, you can set the exact values for xbins along with autobinx = False.

import plotly.graph_objects as go
from plotly.subplots import make_subplots

x = ['1970-01-01', '1970-01-01', '1970-02-01', '1970-04-01', '1970-01-02',
     '1972-01-31', '1970-02-13', '1971-04-19']

fig = make_subplots(rows=3, cols=2)

trace0 = go.Histogram(x=x, nbinsx=4)
trace1 = go.Histogram(x=x, nbinsx=8)
trace2 = go.Histogram(x=x, nbinsx=10)
trace3 = go.Histogram(x=x,
                      xbins=dict(
                      start='1969-11-15',
                      end='1972-03-31',
                      size='M18')) # M18 stands for 18 months
                      
trace4 = go.Histogram(x=x,
                      xbins=dict(
                      start='1969-11-15',
                      end='1972-03-31',
                      size='M4')) # M4 months bin size
                      
trace5 = go.Histogram(x=x,
                      xbins=dict(
                      start='1969-11-15',
                      end='1972-03-31',
                      size='M2')) # M2 months

fig.append_trace(trace0, 1, 1)
fig.append_trace(trace1, 1, 2)
fig.append_trace(trace2, 2, 1)
fig.append_trace(trace3, 2, 2)
fig.append_trace(trace4, 3, 1)
fig.append_trace(trace5, 3, 2)

fig.show()

Share bins between histograms

In this example both histograms have a compatible bin settings using bingroup attribute. Note that traces on the same subplot, and with the same barmode ("stack", "relative", "group") are forced into the same bingroup, however traces with barmode = "overlay" and on different axes (of the same axis type) can have compatible bin settings. Histogram and histogram2d trace can share the same bingroup.

import plotly.graph_objects as go
import numpy as np

fig = go.Figure(go.Histogram(
    x=np.random.randint(7, size=100),
    bingroup=1))

fig.add_trace(go.Histogram(
    x=np.random.randint(7, size=20),
    bingroup=1))

fig.update_layout(
    barmode="overlay",
    bargap=0.1)

fig.show()

Dash Example

Dash is an Open Source Python library which can help you convert plotly figures into a reactive, web-based application. Below is a simple example of a dashboard created using Dash. Its source code can easily be deployed to a PaaS.

from IPython.display import IFrame
IFrame(src= "https://dash-simple-apps.plotly.host/dash-histogramplot/", width="100%", height="650px", frameBorder="0")

from IPython.display import IFrame
IFrame(src= "https://dash-simple-apps.plotly.host/dash-histogramplot/code", width="100%", height=500, frameBorder="0")

Reference

See https://plot.ly/python/reference/#histogram for more information and chart attribute options!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

histograms.md

histograms.md

Histogram with Plotly Express

Choosing the number of bins

Type of normalization

Aspect of the histogram plot

Several histograms for the different values of one column

Using histfunc

Visualizing the distribution

Histograms with go.Histogram

Basic Histogram

Normalized Histogram

Horizontal Histogram

Overlaid Histogram

Stacked Histograms

Styled Histogram

Cumulative Histogram

Specify Aggregation Function

Custom Binning

Share bins between histograms

Dash Example

Reference

Files

histograms.md

Latest commit

History

histograms.md

File metadata and controls

Histogram with Plotly Express

Choosing the number of bins

Type of normalization

Aspect of the histogram plot

Several histograms for the different values of one column

Using histfunc

Visualizing the distribution

Histograms with go.Histogram

Basic Histogram

Normalized Histogram

Horizontal Histogram

Overlaid Histogram

Stacked Histograms

Styled Histogram

Cumulative Histogram

Specify Aggregation Function

Custom Binning

Share bins between histograms

Dash Example

Reference