Skip to content

Conversation

@jwhartley
Copy link
Contributor

Summary

  • Add _meta fields documentation explaining capture metadata (uuid, op, source fields by connector)
  • Add feature flags documentation covering materialization flags like use_existing_table, retain_existing_data_on_backfill, etc.
  • Add custom column types documentation for type overrides in materializations
  • Add troubleshooting section with schema inference issues guide
  • Fix Snowflake SQL syntax in docs

Test plan

  • Site builds successfully (existing HubSpot redirect error is unrelated)
  • Review pages render correctly in dev server

🤖 Generated with Claude Code

jwhartley and others added 4 commits February 2, 2026 14:25
- Add new Troubleshooting section under Using Estuary (guides)
- Add article: Fixing Schema Inference Issues from Bad Upstream Data
  - Documents dataflow reset, castToString, and ddl override options
- Fix Snowflake key-pair auth SQL to use identifier() function
- Add .gitleaksignore for example private key in docs

Co-Authored-By: Claude Opus 4.5 <[email protected]>
Documents connector feature flags including:
- Materialization flags (allow_existing_tables_for_new_bindings, retain_existing_data_on_backfill, datetime_keys_as_string)
- Capture schema inference flags (use_schema_inference, emit_sourced_schemas)
- Migration steps for existing captures to adopt new schema inference behavior

Co-Authored-By: Claude Opus 4.5 <[email protected]>
Explains the metadata fields that captures add to documents including
standard fields (uuid, op, source) and connector-specific source fields
for PostgreSQL, MySQL, SQL Server, Oracle, MongoDB, and Snowflake.

Co-Authored-By: Claude Opus 4.5 <[email protected]>
- Add documentation for custom column types feature
- Expand caveats for use_existing_table and retain_existing_data_on_backfill
- Clarify datetime_keys_as_string default behavior

Co-Authored-By: Claude Opus 4.5 <[email protected]>
@github-actions
Copy link

github-actions bot commented Feb 2, 2026

PR Preview Action v1.8.1

QR code for preview link

🚀 View preview at
https://estuary.github.io/flow/pr-preview/pr-2648/

Built to branch gh-pages at 2026-02-02 09:29 UTC.
Preview will be ready when the GitHub Pages deployment is complete.

jwhartley and others added 3 commits February 2, 2026 14:43
- Note that feature flags are for non-standard/complex situations
- Mention some connectors expose them in UI under Advanced settings

Co-Authored-By: Claude Opus 4.5 <[email protected]>
… Usage

Move metadata-fields, feature-flags, and custom-column-types docs to
guides/advanced-usage/ for better discoverability under Using Estuary.

Co-Authored-By: Claude Opus 4.5 <[email protected]>
@jwhartley jwhartley requested a review from aeluce February 2, 2026 04:55
@jwhartley
Copy link
Contributor Author

Hi @aeluce, I'm grouping some doc edits in this PR that I've accumulated over the last week.

I'm making a concerted effort to make docs if there's a question that has come up multiple times. I've created a new Using Estuary -> Advanced and Troubleshooting section for these new articles, but I'm not fussed about their exact location.

jwhartley and others added 2 commits February 2, 2026 15:59
- Add capture example for feature flags (not just materialization)
- Add explicit JSON paths for LLM/programmatic modification
- Show how to append to existing feature_flags string
- Clarify how to locate bindings by source collection name

Co-Authored-By: Claude Opus 4.5 <[email protected]>
Add comprehensive troubleshooting section covering:
- Write vs read schema explanation
- Auto-inference behavior (widens only)
- NULL values in new columns
- Data flow interruption diagnosis
- Transient validation errors during schema changes
- onIncompatibleSchemaChange configuration
- NoSQL type inference issues

Based on analysis of 316 schema-related support issues.

Co-Authored-By: Claude Opus 4.5 <[email protected]>
@jwhartley
Copy link
Contributor Author

Added schema evolution troubleshooting guide to site/docs/guides/schema-evolution.md covering:

  • How schema evolution works: Write vs read schemas, auto-inference behavior
  • Troubleshooting sections:
    • Why new columns show NULL values
    • Data stopped flowing after schema change (with diagnostic table)
    • Transient validation errors during schema changes
    • Configuring onIncompatibleSchemaChange
    • Type inference issues with NoSQL sources (MongoDB, etc.)

Based on analysis of 316 schema-related support issues from July-Dec 2025.

jwhartley and others added 2 commits February 2, 2026 16:55
Documents how schema inference complexity limits work (1,000 fields
for most sources, 10,000 for SQL) and how to re-add pruned fields
by adding them to the readSchema's third allOf entry.

Co-Authored-By: Claude Opus 4.5 <[email protected]>
Add 3 new troubleshooting sections to schema-evolution.md:
- Schema not yet available for new collections
- Binding not found
- Unsupported DDL operations (MySQL/MariaDB CDC)

These sections provide documentation targets for upcoming error message
improvement tickets that will add links to relevant troubleshooting docs.

Co-Authored-By: Claude Opus 4.5 <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants