Date of original paper Energy consumption (kWh) Carbon footprint (lbs of CO2e) Cloud compute cost (USD)
Transformer (65M parameters)Jun, 20172726$41-$140
Transformer (213M parameters)Jun, 2017201192$289-$981
ELMoFeb, 2018275262$433-$1,472
BERT (110M parameters)Oct, 20181,5071,438$3,751-$12,571
Transformer (213M parameters) w/ neural architecture searchJan, 2019656,347626,155$942,973-$3,201,722
GPT-2Feb, 2019--$12,902-$43,008