Download in Bulk

US ETF Constituents

Introduction

Download the US ETF Constituents dataset in bulk to get the full dataset without any ETF selection bias. The bulk dataset package contains constituents data for all of the supported ETFs for every trading day.

The US ETF Constituents dataset depends on the US Equity Security Master dataset because the US Equity Security Master contains information on splits, dividends, and symbol changes.

Download History

To unlock local access to the US ETF Constituents dataset, open the Pricing page of your organization and subscribe to the US ETF Constituents History by QuantConnect data package. If you don't already subscribe to the US Equity Security Master by QuantConnect data package, subscribe to it too. You need billing permissions to change the organization's subscriptions.

After you subscribe to local access, to download the US ETF Consitutents data, open a terminal in your organization workspace and run:

$ lean data download --dataset "US ETF Constituents" --data-type "Download in Bulk"

To download the US Equity Security Master, run:

$ lean data download --dataset "US Equity Security Master"

Download Daily Updates

After you bulk download the US ETF Constituents dataset, new daily updates are available at 7 AM Eastern Time (ET) after each trading day. To unlock local access to the data updates, open the Pricing page of your organization and subscribe to the US ETF Constituents Updates by QuantConnect data package. You need billing permissions to change the organization's subscriptions.

After you subscribe to dataset updates, to update your local copy of the US ETF Constituents dataset, open a terminal in your organization workspace and run:

$ lean data download --dataset "US ETF Constituents" --data-type "Download in Bulk"

To update your local copy of the US Equity Security Master, run:

$ lean data download --dataset "US Equity Security Master"

Alternatively, instead of directly calling the lean data download command, you can place a Python script in the data directory of your organization workspace and run it to update your data files. The following example script updates all of the new data that's missing from your local copy:

import os
from datetime import datetime
from pytz import timezone

END_DATE = datetime.now(timezone("US/Eastern")).strftime("%Y%m%d")
latest_date = sorted([f for f in os.listdir("equity/usa/universes/etf/spy")])[-1].split(".")[0]
if latest_date >= END_DATE:
    print("Your data is already up to date.")
else:
    print(f"Updating data...")
    os.system(f'lean data download --dataset "US ETF Constituents" --data-type "Download in Bulk"')

The preceding script checks the date of the most recent SPY data you have. If there is new data available for SPY, it downloads the new data files for all of the ETFs. You may need to adjust this script to fit your needs.

Size and Format

The US ETF Constituents dataset is 50 GB in size. We structure the data files so there is one file per ETF per day.

Price

The following table shows the price of an annual subscription to the US Equity Security Master for each organization tier:

TierPrice ($/Year)
Quant Researcher
600
Team
900
Trading Firm
1,200
Institution
1,800

All of the historical US ETF Constituents data costs $3,960. An annual subscription to daily updates costs $1,200/year.

You can also see our Videos. You can also get in touch with us via Discord.

Did you find this page helpful?

Contribute to the documentation: