Add and export getSchema function #363

annie · 2023-04-14T20:07:38Z

This PR adds a getSchema function, building on a suggestion from Mike. This function conditionally infers a schema depending on whether or not a valid one exists on the source, combines the schema with any saved type assertions, and determines whether or not the source data needs to be coerced.

We will use getSchema in both __table and in the table schema tasks in the worker (I'll update https://github.com/observablehq/observablehq/pull/11496 to use this function), and thereby unify the two paths that we currently have for type inference.

I also added a missing unit test for type assertion in __table, as well as some tests for getSchema.

annie · 2023-04-14T20:19:03Z

hm on second thought, maybe we should handle type assertions outside of getSchema, because it's a pain to have to pass around operations.types in the worker.

mbostock · 2023-04-14T20:28:19Z

src/table.js

+    schema = inferSchema(source, isQueryResultSetColumns(columns) ? columns : undefined);
+    return {schema, shouldCoerce: true};
+  }
+  return {schema, shouldCoerce: false};


Maybe we should call this “inferred” instead of “shouldCoerce”? Because “inferred” makes a factual observation about how the schema was derived, whereas “shouldCoerce” implies that the caller should do something with the returned schema—and perhaps the caller wants to do something else?

good call, updated.

mbostock · 2023-04-14T20:41:13Z

src/table.js

+  const schemaInfo = getSchema(source);
+  let {columns} = source;
+  let {schema, inferred} = schemaInfo;


Minor simplification:

Suggested change

const schemaInfo = getSchema(source);

let {columns} = source;

let {schema, inferred} = schemaInfo;

let {columns} = source;

let {schema, inferred} = getSchema(source);

Also: I don’t think we should fix it now, but there’s a latent bug below where where we’re assuming that if source.columns is truthy, it’s a valid QueryResultSetColumns. That’s because this function is trying to preserve the same shape of output as the input; so, if the input had a source.columns, we return a result.columns. I don’t think we need to do that; we could just always return a result.schema instead, now that we’re inferring a schema as needed. But again, reasonable to do that as a separate change to minimize risk since it’s not directly related to this PR’s objective.

hm yeah, now that .schema is always defined, do we even need to return .columns from this function or can we just remove it? or would that be a breaking change, since we export __table from stdlib?

Right, I meant I think that __table can always return .schema (and never .columns). I don’t think that would be breaking… 🤷

Annie Zhang added 2 commits April 14, 2023 15:29

Add and export getSchema function

457762e

more unit tests

dca358f

annie requested review from mbostock and libbey-observable April 14, 2023 20:07

Annie Zhang added 2 commits April 14, 2023 16:26

handle type assertions outside of getSchema; simplify unit tests

3725a4f

move comment

62089f8

mbostock reviewed Apr 14, 2023

View reviewed changes

Annie Zhang added 2 commits April 14, 2023 16:37

use inferred instead of shouldCoerce; use input instead of source

3aff931

aaand update tests too

e4ca320

mbostock approved these changes Apr 14, 2023

View reviewed changes

simplification

1f12213

annie merged commit 76d8919 into main Apr 14, 2023

annie deleted the annie/get-schema branch April 14, 2023 21:46

annie mentioned this pull request Apr 20, 2023

Remove .columns #365

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add and export getSchema function #363

Add and export getSchema function #363

Uh oh!

annie commented Apr 14, 2023

Uh oh!

annie commented Apr 14, 2023

Uh oh!

mbostock Apr 14, 2023

Uh oh!

annie Apr 14, 2023

Uh oh!

mbostock Apr 14, 2023 •

edited

Loading

Uh oh!

annie Apr 14, 2023

Uh oh!

mbostock Apr 14, 2023

Uh oh!

Uh oh!

Add and export getSchema function #363

Add and export getSchema function #363

Uh oh!

Conversation

annie commented Apr 14, 2023

Uh oh!

annie commented Apr 14, 2023

Uh oh!

mbostock Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

annie Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

mbostock Apr 14, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

annie Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

mbostock Apr 14, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mbostock Apr 14, 2023 •

edited

Loading