1.5.0 (2024-05-07)
bigframes.optionsandbigframes.option_contextnow uses thread-local variables to prevent context managers in separate threads from affecting each other (#652) (651fd7d)- Add
ARIMAPlus.coef_property exposingML.ARIMA_COEFFICIENTSfunctionality (#585) (81d1262) - Add a unique session_id to Session and allow cleaning up sessions (#553) (c8d4e23)
- Add the
bigframes.bigquerysub-package with abigframes.bigquery.array_lengthfunction (#630) (9963f85) - Always do a query dry run when
option.repr_mode == "deferred"(#652) (651fd7d) - Custom query labels for compute options (#638) (f561799)
- Warn with
DefaultIndexWarningfromread_gbqon clustered/partitioned tables with noindex_colorfiltersset (#631, #658) (2715d2b, 73064dd) - Support
index_col=Falseinread_csvandengine="bigquery"(73064dd) - Support gcf max instance count in
remote_function(#657) (36578ab)
- Don't raise UnknownLocationWarning for US or EU multi-regions (#653) (8e4616b)
- Fix bug with na in the column labels in stack (#659) (4a34293)
- Use explicit session in
PaLM2TextGenerator(#651) (e4f13c3)
- Add python code sample for multiple forecasting time series (#531) (16866d2)
- Fix the Palm2TextGenerator output token size (#649) (c67e501)
1.4.0 (2024-04-29)
- Add .cache() method to persist intermediate dataframe (#626) (a5c94ec)
- Add transpose support for small homogeneously typed DataFrames. (#621) (054075d)
- Allow single input type in
remote_function(#641) (3aa643f) - Expose gcf max timeout in
remote_function(#639) (dfeaad0) - Series binary ops compatible with more types (#618) (518d315)
- Support the
scoremethod forPaLM2TextGenerator(#634) (3ffc1d2)
- Allow to_pandas to download more than 10GB (#637) (ce56495)
- Extend row hash to 128 bits to guarantee unique row id (#632) (9005c6e)
- Llm fine tuning tests (#627) (4724a1a)
- Llm palm score tests (#643) (cf4ec3a)
- Automatically condense internal expression representation (#516) (03c1b0d)
- Cache transpose to allow performant retranspose (#635) (44b738d)
- Add supported pandas apis on the main page (#628) (8d2a51c)
- Add the first sample for the Single time-series forecasting from Google Analytics data tutorial (#623) (2b84c4f)
- Address more technical writers' feedback (#640) (1e7793c)
1.3.0 (2024-04-22)
- Add
Series.struct.dtypesproperty (#599) (d924ec2) - Add fine tuning
fit()for Palm2TextGenerator (#616) (9c106bd) - Add quantile statistic (#613) (bc82804)
- Expose
max_batching_rowsinremote_function(#622) (240a1ac) - Support primary key(s) in
read_gbqby using as theindex_colby default (#625) (75bb240) - Warn if location is set to unknown location (#609) (3706b4f)
- Address technical writers fb (#611) (9f8f181)
- Infer narrowest numeric type when combining numeric columns (#602) (8f9ece6)
- Use exact median implementation by default (#619) (9d205ae)
- Fix rendering of examples for multiple apis (#620) (9665e39)
- Set
index_colsinread_gbqas a best practice (#624) (70015b7)
1.2.0 (2024-04-15)
- Add hasnans, combine_first, update to Series (#600) (86e0f38)
- Add MultiIndex subclass. (#596) (5d0f149)
- Add pivot_table for DataFrame. (#473) (5f1d670)
- Add Series.autocorr (#605) (4ec8034)
- Support list of numerics in pandas.cut (#580) (290f95d)
- Address more technical writers feedback (#581) (4b08d92)
- Error for object dtype on read_pandas (#570) (8702dcf)
- Inverting int now does bitwise inversion rather than sign flip (#574) (5f1db8b)
- Loc setitem dtype issue. (#603) (b94bae9)
- Toc menu missing plotting name (#591) (eed12c1)
- (Series|Dataframe).dtypes (#598) (edef48f)
- Add code samples for
straccessor methdos (#594) (a557ea2) - Add docs for
DataFrameandSeriesdunder methods (#562) (8fc26c4) - Add examples for at/iat (#582) (3be4a2e)
1.1.0 (2024-04-04)
- (Series|DataFrame).explode (#556) (9e32f57)
- Add
DataFrame.evalandDataFrame.query(#361) (5e28ebd) - Add ColumnTransformer save/load (#541) (9d8cf67)
- Add ml.metrics.mean_squared_error (#559) (853c25e)
- Add support for numpy expm1, log1p, floor, ceil, arctan2 ops (#505) (e8e66cf)
- Add transformers save/load (#552) (d805241)
- Allow DataFrame binary ops to align on either axis and with loc… (#544) (6d8f3af)
- Expose
DataFrame.bqclientto assist in integrations (#519) (0be8911) - Read_pandas accepts pandas Series and Index objects (#573) (f8821fe)
- Support
ML.GENERATE_EMBEDDINGinPaLM2TextEmbeddingGenerator(#539) (1156c1e) - Support max_columns in repr and make repr more efficient (#515) (54e49cf)
- Assign NaN scalar to column error. (#513) (0a4153c)
- Don't download 100gb onto local python machine in load test (#537) (082c58b)
- Exclude list-like s parameter in plot.scatter (#568) (1caac27)
- Fix case where df.peek would fail to execute even with force=True (#511) (8eca99a)
- Fix error in
Series.drop(0)(#575) (75dd786) - Include all names in MultiIndex repr (#564) (b188146)
- Plot.scatter s parameter cannot accept float-like column (#563) (8d39187)
- Product operation produces float result for all input types (#501) (6873b30)
- Reloaded transformer .transform error (#569) (39fe474)
- Rename PaLM2TextEmbeddingGenerator.predict output columns to be backward compatible (#561) (4995c00)
- Respect hard stack size limit and swallow limit change exception. (#558) (4833908)
- Restore string to date/time type coercion (#565) (4ae0262)
- Sync the notebook with embedding changes (#550) (347f2dd)
- Use bytes limit on frame inlining rather than element count (#576) (659a161)
bigframes.options.bigquery.projectandlocationare optional in some circumstances (#548) (90bcec5)- Add "Supported pandas APIs" reference to the documentation (#542) (74c3915)
- Add General Availability banner to README (#507) (262ff59)
- Add opeartions in API docs (#557) (ea95761)
- Add progress_bar code sample (#508) (92a1af3)
- Add the code samples for metrics{auc, roc_auc_score, roc_curve} (#520) (5f37b09)
- Address more comments from technical writers to meet legal purposes (#571) (9084df3)
- Fix docs of ARIMAPlus.predict (#512) (3b80f95)
- Include Index in table-of-contents (#564) (b188146)
- Mark Gemini model as Pre-GA (#543) (769868b)
- Migrate the overview page to Bigframes official landing page (#536) (a0fb8bb)
1.0.0 (2024-03-25)
- rename model parameter
min_rel_progresstotol early_stopsetting no longer supported, always usesTrue- rename model parameter
n_parallell_treeston_estimators - rename
class_weightstoclass_weight - rename
learn_ratetolearning_rate - PCA
n_componentssupports float value andNone, default toNone - rename various ml model parameters for consistency with sklearn (googleapis#491)
- Add configuration option to read_gbq (#401) (85cede2)
- Add ml ARIMAPlus model params (#488) (352cb85)
- Add ml KMeans model params (#477) (23a8d9a)
- Add ml LogisticRegression model params (#481) (f959b65)
- Add ml PCA model params (#474) (fb5d83b)
- Add params for LinearRegression model (#464) (21b2188)
- Add support for Python 3.12 (#231) (df2976f)
- Allow assigning directly to Series.name property (#495) (ad0e99e)
- Ensure
Series.str.len()can get length of array columns (#497) (10c0446) - Option to use bq connection without check (#460) (0b3f8e5)
- PCA
n_componentssupports float value andNone, default toNone(65c6f47) - Rename
class_weightstoclass_weight(65c6f47) - Rename
learn_ratetolearning_rate(65c6f47) - Rename model parameter
min_rel_progresstotol(65c6f47) - Rename model parameter
n_parallell_treeston_estimators(65c6f47) - Rename various ml model parameters for consistency with sklearn (googleapis#491) (65c6f47)
- Support BQ regional endpoints for europe-west9, europe-west3, us-east4, and us-west1 (#504) (fbada4a)
- Support dataframe.cov (#498) (c4beafd)
- Support Series.dt.floor (#493) (2dd01c2)
- Support Series.dt.normalize (#483) (0bf1e91)
- Update plot sample to 1000 rows (#458) (60d4a7b)
early_stopsetting no longer supported, always usesTrue(65c6f47)- Fix -1 offset lookups failing (#463) (2dfb9c2)
- Plot.scatter
cargument functionalities (#494) (d6ee994) - Properly support format param for numerical input. (#486) (ae20c35)
- Renable to_csv and to_json related tests (#468) (2b9a01d)
- Sampling plot cannot preserve ordering if index is not ordered (#475) (a5345fe)
- Use actual BigQuery types rather than ibis types in to_pandas (#500) (82b4f91)
- Add code samples for metrics.{accuracy_score, confusion_matrix} (#478) (3e3329a)
- Add code samples for metrics.{recall_score, precision_score, f11_score} (#502) (370fe90)
- Improve API documentation (#489) (751266e)
- Update bigquery connection documentation (#499) (4bfe094)
- Update LLM + K-means notebook to handle partial failures (#496) (97afad9)
0.26.0 (2024-03-20)
- exclude remote models for .register() (#465)
- (Series|DataFrame).plot (#438) (1c3e668)
read_gbq_tablesupportsLIKEas a operator infilters(#454) (d2d425a)- Add DataFrame.pipe() method (#421) (95f5a6e)
- Set
force=Trueby default inDataFrame.peek()(#469) (4e8e97d) - Support datetime related casting in (Series|DataFrame|Index).astype (#442) (fde339b)
- Support Series.dt.strftime (#453) (8f6e955)
- Any() on empty set now correctly returns False (#471) (f55680c)
- Df.drop_na preserves columns dtype (#457) (3bab1a9)
- Disable to_json and to_csv related tests (#462) (874026d)
- Exclude remote models for .register() (#465) (73fe0f8)
- Fix broken link in covid notebook (#450) (adadb06)
- Fix broken multiindex loc cases (#467) (b519197)
- Fix grouping series on multiple other series (#455) (3971bd2)
- Groupby aggregates no longer check if grouping keys are numeric (#472) (4fbf938)
- Raise
ValueErrorwhenread_pandas()receives a bigframesDataFrame(#447) (b28f9fd) - Series.(to_csv|to_json) leverages bq export (#452) (718a00c)
- Warn when
read_gbq/read_gbq_tableuses the snapshot time cache (#441) (e16a8c0)
- Add code samples for
ml.metrics.r2_score(#459) (85fefa2) - Add the docs for loc and iloc indexers (#446) (14ab8d8)
- Add the pages for at and iat indexers (#456) (340f0b5)
- Add version information to bug template (#437) (91bd39e)
- Indicate that project and location are optional in example notebooks (#451) (1df0140)
0.25.0 (2024-03-14)
- (Series|DataFrame).plot.(line|area|scatter) (#431) (0772510)
- Support CMEK for
remote_functioncloud functions (#430) (2fd69f4)
0.24.0 (2024-03-12)
read_parquetuses a "pandas" engine to parse files by default. Useengine="bigquery"for the previous behavior
- (Series|Dataframe).plot.hist() (#420) (4aadff4)
- Add detect_anomalies to ml ARIMAPlus and KMeans models (#426) (6df28ed)
- Add engine parameter to
read_parquet(#413) (31325a1) - Add ml PCA.detect_anomalies method (#422) (8d82945)
- Support BYOSA in
remote_function(#407) (d92ced2) - Support CMEK for BQ tables (#403) (9a678e3)
- Move
third_party.bigframes_vendoredtobigframes_vendored(#424) (763edeb) - Only do row identity based joins when joining by index (#356) (76b252f)
- Read_pandas inline respects location (#412) (ae0e3ea)
- Add predict sample to samples/snippets/bqml_getting_started_test.py (#388) (6a3b0cc)
- Document minimum IAM requirement (#416) (36173b0)
- Fix the note rendering for DataFrames methods: nlargest, nsmallest (#417) (38bd2ba)
0.23.0 (2024-03-05)
- Add ml.metrics.pairwise.euclidean_distance (#397) (1726588)
- Add TextEmbedding model version support (#394) (e0f1ab0)
- Code exception in
remote_functionnow prevents retry and surfaces in the client (#387) (dd3643d) - Docs link for metrics.pairwise (#400) (a60aba7)
0.22.0 (2024-02-27)
- rename cosine_similarity to paired_cosine_distances (#393)
- move model optional args to kwargs (#381)
- Add
DataFrames.corr()method (#379) (67fd434) - Add ml.metrics.pairwise.manhattan_distance (#392) (9d31865)
- Enable regional endpoints for me-central2 (#386) (469674d)
- Avoid ibis warning for "database" table() method argument (#390) (a0490a4)
- Correct the numeric literal dtype (#365) (93b02cd)
- Rename cosine_similarity to paired_cosine_distances (#393) (81ece46)
- Add a code sample for creating a kmeans model (#267) (4291d65)
- Fix
bigframes.pandas.concatdocumentation (#382) (234b61c)
0.21.0 (2024-02-13)
- Add
Series.covmethod (#368) (443db22) - Add ml.llm.GeminiTextGenerator model (#370) (de1e0a4)
- Add ml.metrics.pairwise.cosine_similarity function (#374) (126f566)
- Add XGBoostModel (#363) (d5518b2)
- Limited support of lambdas in
Series.apply(#345) (208e081) - Support bigframes.pandas.to_datetime for scalars, iterables and series. (#372) (ffb0d15)
- Support read_gbq wildcard table path (#377) (90caf86)
0.20.1 (2024-02-06)
- Add a sample to demonstrate the evaluation results (#364) (cff0919)
- Fix the
DataFrame.applycode sample (#366) (1866a26)
0.20.0 (2024-01-30)
- Add
DataFrame.peek()as an efficient alternative tohead()results preview (#318) (9c34d83) - Add ARIMA_EVAULATE options in forecasting models (#336) (73e997b)
- Add Index constructor, repr, copy, get_level_values, to_series (#334) (e5d054e)
- Improve error message for drive based BQ table reads (#344) (0794788)
- Update cut to work without labels = False and show intervals as dict (#335) (4ff53db)
- Chance default connection name in getting_started.ipnyb (#347) (677f014)
- Series iteration correctly returns values instead of index (#339) (2c6af9b)
0.19.2 (2024-01-22)
- Read_gbq large response issue (#332) (b8178b9)
- Use object dtype for ARRAY columns in
to_pandas()with pandas 1.x (#329) (374ddb5)
- Add
DataFrame.applymapdocumentation (#326) (bd531a1) - Add code samples for series methods (#323) (32cc6fa)
- Add remote model requirements (#333) (c91f70c)
0.19.1 (2024-01-17)
- Handle multi-level columns for df aggregates properly (#305) (5bb45ba)
- Update max_output_token limitation. (#308) (5cccd36)
0.19.0 (2024-01-09)
- Add 'columns' as an alias for 'col_order' (#298) (a01b271)
- Add Series dt.tz and dt.unit properties (#303) (2e1a403)
- Add to_gbq() method for LLM models (#299) (dafbc1b)
- Allow manually set clustering_columns in dataframe.to_gbq (#302) (9c21323)
- Support assigning to columns like a property (#304) (f645c56)
- Support upcasting numeric columns in concat (#294) (e3a056a)
- DF.drop tuple input as multi-index (#301) (21391a9)
- Fix bug converting non-string labels to sql ids (#296) (a61c5fe)
0.18.0 (2024-01-02)
- Add dataframe.to_html (#259) (2cd6489)
- Add IntervalIndex support to bigframes.pandas.cut (#254) (6c1969a)
- Add replace method to DataFrame (#261) (5092215)
- Specific pyarrow mappings for decimal, bytes types (#283) (a1c0631)
- Dataframes to_gbq now creates dataset if it doesn't exist (#222) (bac62f7)
- Exclude pandas 2.2.0rc0 to unblock prerelease tests (#292) (ac1a745)
- Fix DataFrameGroupby.agg() issue with as_index=False (#273) (ab49350)
- Make
Series.str.replacework for simple strings (#285) (ad67465) - Update dataframe.to_gbq to dedup column names. (#286) (746115d)
- Use setuptools.find_namespace_packages (#246) (9ec352a)
- Add code snippets for explore query result page (#278) (7cbbb7d)
- Code samples for
astypecommon to DataFrame and Series (#280) (95b673a) - Code samples for
DataFrame.copyandSeries.copy(#290) (7cbc2b0) - Code samples for
dropandfillna(#284) (9c5012e) - Code samples for
isna,isnull,dropna,isin(#289) (ad51035) - Code samples for
rename,size(#293) (eb69f60) - Code samples for
reset_indexandsort_values(#282) (acc0eb7) - Code samples for
sample,get,Series.round(#295) (c2b1892) - Code samples for
Series.{add, replace, unique, T, transpose}(#287) (0e1bbfc) - Code samples for
Series.{map, to_list, count}(#290) (7cbc2b0) - Code samples for
Series.{name, std, agg}(#293) (eb69f60) - Code samples for
Series.groupbyandSeries.{sum,mean,min,max}(#280) (95b673a) - Code samples for DataFrame
set_index,items(#295) (c2b1892) - Fix the rendering for
get_dummies(#291) (252f3a2)
0.17.0 (2023-12-14)
- Add
filtersargument toread_gbqfor enhanced data querying (#198) (034f71f) - Add module/class level api tracking (#272) (4f3db3d)
- Deprecate
use_regional_endpoints(#199) (319a1f2)
- Increase recursion limit, cache compilation tree hashes (#184) (b54791c)
- Replaced raise
NotImplementedErrorwith returnNotImplemented(#258) (a133822)
- Add code samples for
valuesandvalue_counts(#249) (f247d95) - Add sample for getting started with BQML (#141) (fb14f54)
0.16.0 (2023-12-12)
- Add ARIMAPlus.predict parameters (#264) (99598c7)
- Add DataFrame from_dict and from_records methods (#244) (8d81e24)
- Add DataFrame.select_dtypes method (#242) (1737acc)
- Add nunique method to Series/DataFrameGroupby (#256) (c8ec245)
- Support dataframe.loc with conditional columns selection (#233) (3febea9)
- Enfore pandas version requirement <2.1.4 (#265) (9dd63f6)
- Exclude pandas 2.1.4 from prerelease tests to unblock e2e tests (b02fc2c)
- Fix value_counts column label for normalize=True (#245) (d3fa6f2)
- Migrate e2e tests to bigframes-load-testing project (8766ac6)
- Ml.sql logic (#262) (68c6fdf)
- Update the llm_kmeans notebook (#247) (66d1839)
- Add code samples for
shapeandhead(#257) (5bdcc65) - Add example for dataframe.melt, dataframe.pivot, dataframe.stac… (#252) (8c63697)
- Add example to dataframe.nlargest, dataframe.nsmallest, datafra… (#234) (e735412)
- Add examples for dataframe.cummin, dataframe.cummax, dataframe.cumsum, dataframe.cumprod (#243) (0523a31)
- Add examples for dataframe.nunique, dataframe.diff, dataframe.a… (#251) (77074ec)
- Correct the docs for
option_context(#263) (d21c6dd) - Correct the params rendering for
ml.remoteandml.ensemblemodules (#248) (c2829e3) - Fix return annotation in API docstrings (#253) (89a1c67)
0.15.0 (2023-11-29)
- model.predict returns all the columns (#204)
- Add info and memory_usage methods to dataframe (#219) (9d6613d)
- Add remote vertex model support (#237) (0bfc4fb)
- Add the recent api method for ML component (#225) (ed8876d)
- Model.predict returns all the columns (#204) (416171a)
- Send warnings on LLM prediction partial failures (#216) (81125f9)
- Add df snapshots lookup for
read_gbq(#229) (d0d9b84) - Avoid unnecessary row_number() on sort key for io (#211) (a18d40e)
- Dedup special character (#209) (dd78acb)
- Invalid JSON type of the notebook (#215) (a729831)
- Make to_pandas override enable_downsampling when sampling_method is manually set. (#200) (ae03756)
- Polish the llm+kmeans notebook (#208) (e8532b1)
- Update the llm+kmeans notebook with recent change (#236) (f8917ab)
- Use anonymous dataset to create
remote_function(#205) (69b016e)
- Add code samples for
indexandcolumnproperties (#212) (c88d38e) - Add code samples for df reshaping, function, merge, and join methods (#203) (010486c)
- Add examples for dataframe.kurt, dataframe.std, dataframe.count (#232) (f9c6e72)
- Add examples for dataframe.mean, dataframe.median, dataframe.va… (#228) (edd0522)
- Add examples for dataframe.min, dataframe.max and dataframe.sum (#227) (3a375e8)
- Code samples for
Series.dotandDataFrame.dot(#226) (b62a07a) - Code samples for
Series.whereandSeries.mask(#217) (52dfad2) - Code samples for dataframe.any, dataframe.all and dataframe.prod (#223) (d7957fa)
- Make the code samples reflect default bq connection usage (#206) (71844b0)
0.14.1 (2023-11-16)
0.14.0 (2023-11-14)
- Add 'cross' join support (#176) (765446a)
- Add 'index', 'pad', 'nearest' interpolate methods (#162) (6a28403)
- Add series.sample (identical to existing dataframe.sample) (#187) (37914a4)
- Add unordered sql compilation (#156) (58f420c)
- Log most recent API calls as
recent-bigframes-api-xxlabels on BigQuery jobs (#145) (4ea33b7) - Read_gbq creates order deterministically without table copy (#191) (8ab81de)
- Support
date_series.astype("string[pyarrow]")to cast DATE to STRING (#186) (aee0e8e) - Support
series.at[row_label] = scalar(#173) (0c8bd33) - Temporary resources no longer use BigQuery Sessions (#194) (4a02cac)
- All sort operation are now stable (#195) (3a2761f)
- Default to 7 days expiration for
read_csv,read_json,read_parquet(#193) (03606cd) - Deprecate the
remote_service_typein llm model (#180) (a8a409a) - For reset_index on unnamed multiindex, always use level_[n] label (#182) (f95000d)
- Match pandas behavior when assigning listlike to empty dfs (#172) (c1d1f42)
- Use anonymous dataset instead of session dataset for temp tables (#181) (800d44e)
- Use random table for
read_pandas(#192) (741c75e) - Use random table when loading data for
read_csv,read_json,read_parquet(#175) (9d2e6dc)
- Add code samples for
read_gbq_functionusing community UDFs (#188) (7506eab) - Add docstring code samples for
Series.applyandDataFrame.map(#185) (c816d84) - Add llm kmeans notebook as an included example (#177) (d49ae42)
- Use
head()to get topnresults, not to preview results (#190) (87f84c9)
0.13.0 (2023-11-07)
to_gbqwithout a destination table writes to a temporary table (#158) (e1817c9)- Add
DataFrame.__iter__,DataFrame.iterrows,DataFrame.itertuples, andDataFrame.keysmethods (#164) (c065071) - Add
Series.__iter__method (#164) (c065071) - Add interpolate() to series and dataframe (#157) (b9cb55c)
- Support 32k text-generation and multilingual embedding models (#161) (5f0ea37)
0.12.0 (2023-11-01)
- Add
DataFrame.melt(#113) (4e4409c) - Add
DataFrame.to_pandas_batches()to download largeDataFrameobjects (#136) (3afd4a3) - Add bigframes.options.compute.maximum_bytes_billed option that sets maximum bytes billed on query jobs (#133) (63c7919)
- Add pandas.qcut (#104) (8e44518)
- Add pd.get_dummies (#149) (d8baad5)
- Add unstack to series, add level param (#115) (5edcd19)
- Implement operator
@forDataFrame.dot(#139) (79a638e) - Populate ibis version in user agent (#140) (c639a36)
- Don't override the global logging config (#138) (2ddbf74)
- Fix bug with column names under repeated column assignment (#150) (29032d0)
- Resolve plotly rendering issue by using ipython html for job pro… (#134) (39df43e)
- Use indexee's session for loc listlike cases (#152) (27c5725)
- Add artithmetic df sample code (#153) (ac44ccd)
- Fix indentation on
read_gbq_functioncode sample (#163) (0801d96) - Link to ML.EVALUATE BQML page for score() methods (#137) (45c617f)
0.11.0 (2023-10-26)
- Add back
reset_sessionas an alias forclose_session(#124) (694a85a) - Change
queryparameter toquery_or_tableinread_gbq(#127) (f9bb3c4)
- Expose
bigframes.pandas.reset_sessionas a public API (#128) (b17e1f4) - Use series's own session in series.reindex listlike case (#135) (95bff3f)
- Add runnable code samples for DataFrames I/O methods and property (#129) (6fea8ef)
- Add runnable code samples for reading methods (#125) (a669919)
0.10.0 (2023-10-19)
0.9.0 (2023-10-18)
- rename
bigframes.pandas.reset_sessiontoclose_session(#101)
- Add
bigframes.options.bigquery.application_namefor partner attribution (#117) (52d64ff) - Add AtIndexer getitems (#107) (752b01f)
- Rename
bigframes.pandas.reset_sessiontoclose_session(#101) (36693bf) - Send BigQuery cancel request when canceling bigframes process (#103) (e325fbb)
- Support external packages in
remote_function(#98) (ec10c4a) - Use ArrowDtype for STRUCT columns in
to_pandas(#85) (9238fad)
- Add documentation for
Series.struct.fieldandSeries.struct.explode(#114) (a6dab9c) - Add open-source link in API doc (#106) (db51fe3)
- Update ML overview API doc (#105) (1b3f3a5)
0.8.0 (2023-10-12)
- The default behavior of
to_parquetis changing from no compression to'snappy'compression.
- Support compression in
to_parquet(a8c286f)
0.7.0 (2023-10-11)
- Add aliases for several series properties (#80) (c0efec8)
- Add equals methods to series/dataframe (#76) (636a209)
- Add iat and iloc accessing by tuples of integers (#90) (228aeba)
- Add level param to DataFrame.stack (#88) (97b8bec)
- Allow df.drop to take an index object (#68) (740c451)
- Use default session connection (#87) (4ae4ef9)
0.6.0 (2023-10-04)
- Add df.unstack (#63) (4a84714)
- Add idxmin, idxmax to series, dataframe (#74) (781307e)
- Add ml.preprocessing.KBinsDiscretizer (#81) (24c6256)
- Add multi-column dataframe merge (#73) (c9fa85c)
- Add update and align methods to dataframe (#57) (bf050cf)
- Support STRUCT data type with
Series.struct.fieldto extract child fields (#71) (17afac9)
- Avoid
403 response too large to returnerror withread_gbqand large query results (#77) (8f3b5b2) - Change return type of
Series.loc[scalar](#40) (fff3d45) - Fix df/series.iloc by list with multiindex (#79) (971d091)
0.5.0 (2023-09-28)
- Add
DataFrame.kurtosis/DF.kurtmethod (c1900c2) - Add
DataFrame.rollingandDataFrame.expandingmethods (c1900c2) - Add
items,applymethods toDataFrame. (#43) (3adc1b3) - Add axis param to simple df aggregations (#52) (9cf9972)
- Add index
dtype,astype,drop,fillna, aggregate attributes. (#38) (1a254a4) - Add ml.preprocessing.LabelEncoder (#50) (2510461)
- Add ml.preprocessing.MaxAbsScaler (#56) (14b262b)
- Add ml.preprocessing.MinMaxScaler (#64) (392113b)
- Add more index methods (#54) (a6e32aa)
- Support
calculate_p_valuesparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support
class_weights="balanced"inLogisticRegressionmodel (c1900c2) - Support
df[column_name] = df_only_one_column(c1900c2) - Support
early_stopparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support
enable_global_explainparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support
l2_regparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support
learn_rate_strategyparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support
ls_init_learn_rateparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support
max_iterationsparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support
min_rel_progressparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support
optimize_strategyparameter inbigframes.ml.linear_model.LinearRegression(c1900c2) - Support casting string to integer or float (#59) (3502f83)
- Fix header skipping logic in
read_csv(#49) (d56258c) - Generate unique ids on join to avoid id collisions (#65) (7ab65e8)
- LabelEncoder params consistent with Sklearn (#60) (632caec)
- Loosen filter items tests to accomodate shifting pandas impl (#41) (edabdbb)
- Add ability to cache dataframe and series to session table (#51) (416d7cb)
- Inline small
SeriesandDataFramesin query text (#45) (5e199ec) - Reimplement unpivot to use cross join rather than union (#47) (f9a93ce)
- Simplify join order to use multiple order keys instead of string. (#36) (5056da6)
- Link to Remote Functions code samples from README and API reference (c1900c2)
0.4.0 (2023-09-16)
- Add
axisparameter todroplevelandreorder_levels(7c6b0dd) - Add
bfillandffilltoDataFrameandSeries(7c6b0dd) - Add
DataFrame.combineandDataFrame.combine_first(#27) (7c6b0dd) - Add
DataFrame.nlargest,nsmallest(7c6b0dd) - Add
DataFrame.pct_changeandSeries.pct_change(7c6b0dd) - Add
DataFrame.skewandGroupBy.skew(7c6b0dd) - Add
DataFrame.to_dict,to_excel,to_latex,to_records,to_string,to_markdown,to_pickle,to_orc(7c6b0dd) - Add
diffmethod toDataFrameandGroupBy(7c6b0dd) - Add
filterandreindextoSeriesandDataFrame(7c6b0dd) - Add
reindex_liketoDataFrameandSeries(7c6b0dd) - Add
swapleveltoDataFrameandSeries(7c6b0dd) - Add partial support for
Sereies.replace(7c6b0dd) - Support
DataFrame.loc[bool_series, column] = scalar(7c6b0dd) - Support a persistent
nameinremote_function(7c6b0dd)
remote_functionuses same credentials as other APIs (7c6b0dd)- Add type hints to models (7c6b0dd)
- Raise error when ARIMAPlus is used with Pipeline (7c6b0dd)
- Remove
transformsparameter inmodel.fit(breaking change) (7c6b0dd) - Support column joins with "None indexer" (7c6b0dd)
- Use for literals
Int64Dtypeincut(7c6b0dd) - Use lowercase strings for parameter literals in
bigframes.ml(breaking change) (7c6b0dd)
bigframes-apilabel to I/O query jobs (7c6b0dd)
- Document possible parameter values for PaLM2TextGenerator (7c6b0dd)
- Document region logic in README (7c6b0dd)
- Fix OneHotEncoder sample (7c6b0dd)
0.3.2 (2023-09-06)
0.3.1 (2023-09-05)
0.3.0 (2023-09-02)
- Add
bigframes.get_global_session()andbigframes.reset_session()aliases (a32b747) - Add
bigframes.pandas.read_picklefunction (a32b747) - Add
components_,explained_variance_, andexplained_variance_ratio_properties tobigframes.ml.decomposition.PCA(89b9503) - Add
fit_transformtobigquery.mltransformers (a32b747) - Add
Series.dropnaandDataFrame.fillna(8fab755) - Add
Series.strmethodsisalpha,isdigit,isdecimal,isalnum,isspace,islower,isupper,zfill,center(a32b747) - Support
bigframes.pandas.merge()(8fab755) - Support
DataFrame.isinwith list and dict inputs (8fab755) - Support
DataFrame.pivot(a32b747) - Support
DataFrame.stack(89b9503) - Support
DataFrame-DataFramebinary operations (8fab755) - Support
df[my_column] = [a python list](89b9503) - Support
Index.is_monotonic(8fab755) - Support
np.arcsin,np.arccos,np.arctan,np.sinh,np.cosh,np.tanh,np.arcsinh,np.arccosh,np.arctanh,np.expwith Series argument (89b9503) - Support
np.sin,np.cos,np.tan,np.log,np.log10,np.sqrt,np.abswith Series argument (89b9503) - Support
pow()and power operator inDataFrameandSeries(8fab755) - Support
read_jsonwithengine=bigqueryfor newline-delimited JSON files (89b9503) - Support
Series.corr(89b9503) - Support
Series.map(8fab755) - Support for
np.add,np.subtract,np.multiply,np.divide,np.power(8fab755) - Support MultiIndex for DataFrame columns (a32b747)
- Use
pandas.Indexfor column labels (a32b747) - Use default session and connection in
ml.llmandml.imported(8fab755)
- Add error message to
set_index(a32b747) - Align column names with pandas in
DataFrame.aggresults (89b9503) - Allow (but still not recommended)
ORDER BYinread_gbqinput when anindex_colis defined (89b9503) - Check for IAM role on the BigQuery connection when initializing a
remote_function(89b9503) - Check that types are specified in
read_gbq_function(a32b747) - Don't use query cache for Session construction (a32b747)
- Include survey link in abstract
NotImplementedErrorexception messages (89b9503) - Label temp table creation jobs with
source=bigquery-dataframes-templabel (89b9503) - Make
X_trainargument names consistent across methods (8fab755) - Raise AttributeError for unimplemented pandas methods (89b9503)
- Raise exception for invalid function in
read_gbq_function(a32b747) - Support spaces in column names in
DataFrameinitializater (89b9503)
- Add local cache for
__repr_*__methods (a32b747) - Lazily instantiate client library objects (89b9503)
- Use
row_number()filter forhead/tail(8fab755)
- Add ML section under Overview (a32b747)
- Add release status to table of contents (a32b747)
- Add samples and best practices to
read_gbqdocs (a32b747) - Correct the return types of Dataframe and Series (a32b747)
- Create subfolders for notebooks (a32b747)
- Fix link to GitHub (89b9503)
- Highlight bigframes is open-source (a32b747)
- Sample ML Drug Name Generation notebook (a32b747)
- Set
options.bigquery.projectin sample code (89b9503) - Transform remote function user guide into sample code (a32b747)
- Update remote function notebook with read_gbq_function usage (8fab755)
- Add KMeans.cluster_centers_.
- Allow column labels to be any type handled by bq df, column labels can be integers now.
- Add dataframegroupby.agg().
- Add Series Property is_monotonic_increasing and is_monotonic_decreasing.
- Add match, fullmatch, get, pad str methods.
- Add series isin function.
- Update ML package to use sessions for queries.
- Optimize
read_gbqwithindex_colset to cluster byindex_col. - Raise ValueError if the location mismatched.
read_gbqno longer uses 'time travel' with query inputs.
- Add docstring to _uniform_sampling to avoid user using it.
- Correct link to code repository in
setup.pyand use correct terminology forconsole.cloud.google.comlinks.
- Add
bigframes.pandaspackage with an API compatible with pandas. Supported data sources include: BigQuery SQL queries, BigQuery tables, CSV (local and GCS), Parquet (local and Cloud Storage), and more. - Add
bigframes.mlpackage with an API inspired by scikit-learn. Train machine learning models and run batch predicition, powered by BigQuery ML.
0.0.0 (2023-02-22)
- Empty package to reserve package name.