Macy's USA — Full Product Catalog

Product-level catalog from macys.com with per-product color lists, star ratings, and review counts. 325K+ deduplicated rows across 5,300+ brands — covering pricing, promotional tags, ratings, and images across all major departments.

✓ Active Schema: E0005r Country: USA Currency: USD Language: English Last delivered: 2026-04-15 Started: 2026-04-08 €90.00 / delivery
Scale
Total rows (deduped)
325K+
Raw rows before dedup
801K
Unique prod codes
325,712
Brands
5,338
File size
348.85 MB
Competence date
2026-04-14
Sample rows
product_codebrandproduct_titlecolor_info (all colors)full_pricepriceadditional_tagscategory2
23476271::defaultCategoryIdI.N.C. International ConceptsWomen's Menswear BlazerPink Rose Gem, Sweet Pea, Skyride, Icy Rose, Deep Black$39.75$39.75Women's Blazers
19848686::defaultCategoryIdForever Grown DiamondsLab-Grown Diamond Three Stone Ring (2 ct. t.w.)Sterling Silver, Gold-Plated Sterling Silver$1,000$19980% offRing
12469739::defaultCategoryIdEFFY CollectionEFFY Sapphire & Diamond Stud Earrings in 14k White GoldSapphire, Ruby, Emerald$3,850$1,347.5065% offEarrings
25909866::defaultCategoryIdKARL LAGERFELD PARISWomen's Placed Floral Printed Collared BlazerSoft White$209$209Women's Coats & Jackets
15956563::defaultCategoryIdI.N.C. International ConceptsWomen's Mid-Rise Bootcut PantsDeep Black, Bright White, Warm Toffee, Indigo Sea, Summer Straw$59.50$59.50Women's Pants

5 of 325,906 rows. The color_info column lists ALL colors for the product in one row — this is not a per-color row dataset.

Product identification — code structure
FieldType valueExample valueMeaning
product_code23476271::defaultCategoryIdUnique product ID. Format: numeric_id::defaultCategoryId. The "::defaultCategoryId" suffix is identical for every row — the meaningful identifier is the numeric part.
additional_code_1defaultCategoryId55429Macy's internal category ID — not a UPC or barcode. The type field is "defaultCategoryId" for all rows.
additional_code_1_typedefaultCategoryIdConstant "defaultCategoryId" across all rows
additional_content — ratings & reviews
additional_content — plain text format
stars:4.2/5    reviews:2518

Star rating and review count as plain text: stars:X.X/5 reviews:NNNN. Present for ~45.6% of products — those that have visible ratings on site. Empty for unrated products. Parse by splitting on whitespace.

color_info and variant_info — product-level color list

Both fields contain the same value: a comma-separated list of all available colors for the product in a single row. This is not a per-color dataset — one row = one product, with all its color options listed together. Example: "Deep Black, Bright White, Warm Toffee, Indigo Sea, Summer Straw". Fill rate ~87.9% — products without color variants (e.g. some home goods) have empty fields.

additional_tags — promotional labels

Promotional badge text when shown on site (~22% fill). Examples: "80% off", "65% off", "EXTRA 25% OFF". The promotion_end_date field captures the expiry date when available (~31.8% fill).

Department breakdown
Women's Fashion, Shoes & Accessories~27%
Home~27%
Jewelry & Watches~12%
Kids~6%
Beauty~6%
Men's Fashion, Shoes & Accessories~5%
Toys~5%
Electronics~4%
Other (Luggage, Juniors, Lifestyle)~8%
Field fill rates
FieldFill rateCoverage
All fields — status & notes
FieldStatusNotes
product_code
Filled
numeric_id::defaultCategoryId — unique per product
additional_code_1
Filled
Macy's internal category ID — not a UPC barcode
additional_code_1_type
Filled
"defaultCategoryId" for all rows
brand
Filled
Brand name as shown on site — 5,338 unique brands
color_info
Partial
All available colors for the product, comma-separated in one field (~87.9% fill)
variant_info
Partial
Same value as color_info (~87.9% fill)
product_title
Filled
Product name exactly as shown on site
description
Filled
Product description text
specifications
Filled
Product detail bullet points, pipe-separated
additional_content
Partial
Ratings and reviews: "stars:4.2/5 reviews:2518" (~45.6% fill — empty when no reviews)
additional_tags
Partial
Promotional badge text — e.g. "80% off", "65% off" (~22% fill)
in_stock
Filled
Stock status
category1
Filled
Top-level department (e.g. "WOMEN'S FASHION, SHOES & ACCESSORIES")
category2
Filled
Sub-category (e.g. "Women's Blazers", "Ring") — ~99% fill
category3–10
N/A on site
Macy's uses a 2-level category hierarchy only
full_price
Filled
Regular retail price (USD) before any discount
price
Filled
Actual offered price — lower than full_price when on sale
promotion_end_date
Partial
Sale expiry date when shown on site (~31.8% fill)
itemurl
Filled
Direct product page URL
main_image_url
Filled
Primary product image — never truncated
all_images
Filled
JSON array of all product images — capped at 1,000 chars
size_info
N/A on site
Not exposed at product listing level on Macy's
material_info
Not collected
Not currently collected
store_* / merchant_*
N/A on site
Single online retailer — no store or marketplace seller data
quantity / ppu / unit_type
N/A on site
Not exposed by Macy's
delivery
Not collected
Improvement opportunity
website_name
Filled
"macys.com"
competence_date
Filled
Date of collection (YYYY-MM-DD)
country_code / currency_code
Filled
"USA" / "USD"
Filled Partial fill Not collected / N/A on site
Improvement opportunities
deliveryEasyShipping info occasionally shown on PDP
promotion_typeEasyPromo label already in additional_tags — could be mirrored here for schema consistency
material_infoMediumMaterials and care info present on some PDPs
size_infoMediumAvailable sizes visible on PDP — requires size-level scraping
File delivery path
s3://databoutique.com/sellers/YOUR_SELLER_ID/YOUR_CONTRACT_ID/YYYY-MM-DD/data_file.txt
Contact
providerIfeanyi Muotoe — Web Scraping Specialist
emailifeanyi.webapp@gmail.com
githubgithub.com/Anzywiz
portfolioifeanyimuotoe.streamlit.app