This is a Python library that binds to Apache Arrow in-memory query engine
DataFusion.
DataFusion's Python bindings can be used as a foundation for building new data
systems in Python. Here are some examples:
- Dask SQL uses DataFusion's Python bindings for SQL parsing, query planning,
and logical plan optimizations, and then transpiles the logical plan to Dask
operations for execution.
- DataFusion Ballista is a distributed SQL query engine that extends
DataFusion's Python bindings for distributed use cases.
- DataFusion Ray is another distributed query engine that uses DataFusion's
Python bindings.
OCSP_USE_PYTHON helper have no effect, although the dependencies
are installed anyway due to other dependencies. However, adding
'cryptography' to OCSP_RUN_DEPENDS does have an effect.
Approved by: acm (mentor)
lfcbase:
- Fixed BigDecimal class for internal constructor to
normalize zero values to "0" and in scaleTo method to treat
zero values correctly
- Added class Replacer to support string replacement based
on regular expressions ( like unix sed )
- Fixed BigDecimal, scale handling still not correct for some cases
- Replacer: The match token has been changed to $ char to treat
database integration
cego:
- Performance optimization added in CegoSelect and CegoDistCursor
for or-condition handling:
If an or-condition is recognized, the predicate is propagateed
to the corresponding join level cursor and the evaluation is handled
by the cursor.
This can avoid large table cursor result sets for the embedding
join and might improve overall performance of the query.
- For the new introduced query optimization, several code extensions
have been added. In CegoPredicate, new methods map, getReduced,
mapExpr and reduceExpr have been added to convert the "master"
predicate to an appropriate form, which can be handled by CegoDistCursor.
Also for for CegoDistCursor, the distSetup methods have been expanded
to propagate the master predicate
- Added fix in CegoAction for the following query:
select invid from invoice where status = ( select 'OPEN' );
In CegoAction::noFromOption, for _coListStack has to be pushed empty coList
- Added verify010 db check with a small invoice er model. This is
used for advanced query verification with multilevel nested views
and other dynamic stuff which is also relevant to proof valid results
from query cache
Valkey-py is a python library that allows you to connect to Valkey,
the open source (BSD) high-performance key/value datastore that
supports a variety of workloads such as caching, message queues,
and can act as a primary database.
Approved by: acm (mentor)
Libvalkey is the official C client for the Valkey database. It also
supports any server that uses the RESP protocol (version 2 or 3).
This project supports both standalone and cluster modes.
Approved by: acm (mentor)
DataLoader is a generic utility to be used as part of your application's data
fetching layer to provide a simplified and consistent API over various remote
data sources such as databases or web services via batching and caching.
A port of the "Loader" API originally developed by @schrockn at Facebook in 2010
as a simplifying force to coalesce the sundry key-value store back-end APIs
which existed at the time. At Facebook, "Loader" became one of the
implementation details of the "Ent" framework, a privacy-aware data entity
loading and caching layer within web server product code. This ultimately became
the underpinning for Facebook's GraphQL server implementation and type
definitions.
Asyncio DataLoader is a Python port of the original JavaScript DataLoader
implementation. DataLoader is often used when implementing a GraphQL service,
though it is also broadly useful in other situations.
lfcbase:
- BigDecimal::scaleTo: added RoundMode parameter which can either be
UP ( round up )
DOWN ( round down )
HALFUP ( round up if neigbours equidistant or higher ) or
HALFDOWN ( rond down if neighbors are equidistant or lower )
cego:
- Added support for lfcbase-1.21.8 ( BigDecimal::RoundMode ),
Round mode ist setup in dbxml via database ROUNDMODE attribute
( either UP, DOWN, HALFUP, HALFDOWN )
--soname=libsqlite3.so.${DISTVERSION} mean that users must
rebuild/reinstall all consumers of the sqlite3 after each patch-level
update.
Use --soname=legacy => soname = libsqlite3.so.0 - ABI and API didn't
changed.
PR: 287983
Approved by: Pavel Volkov <pavelivolkov@gmail.com> (maintainer, implicit - fix runtime)
Fixes: 8af3fe5ef31b (update to 3.50.2)
Co-authored-by: russian@russerver.org
- Added performance patch to CegoSelect::buildJoinConditions. pPred
is set to checked if analyzed as complete, also if the predicate
is external ( setup via setViewCond ) In the meantime, external
conditions are fully handeled via CegoDistCursor
- In CegoFieldValue::toChain, for case DATETIME resulting string
was corrected. In the meantime, we avoid date value construction
via date/scandate function, either the required value is calculated
via cast. So just the date string value is provided in the defined
datetime format.
- Added datetime cast operator to convert from string value to
datetime. If used, this avoids expensive real time cast operations
for queries and should accelerate execution time
libprotobuf-c.so once again has version info. Force a rebuild of
its consumers.
PR: 282060
MFH: 2025Q3
Sponsored by: <If the change was sponsored by an organization.>
sq is a command-line tool that provides jq-style access to structured
data sources including SQL databases and document formats. It supports
cross-source joins, multiple output formats, and unified querying across
various data sources.
WWW: https://sq.io/