About 50 results
Open links in new tab
  1. Unable to use dask-sql due to 'dask_expr.io' module

    Jul 7, 2025 · However, since dask 2025.1.0 release, dask-expr was merged in Dask. It is possible that latest versions of dask or dask-expr package are not well supported by dask-sql.

  2. How to transform Dask.DataFrame to pd.DataFrame?

    Aug 18, 2016 · How can I transform my resulting dask.DataFrame into pandas.DataFrame (let's say I am done with heavy lifting, and just want to apply sklearn to my aggregate result)?

  3. Reading an SQL query into a Dask DataFrame - Stack Overflow

    May 24, 2022 · I'm trying create a function that takes an SQL SELECT query as a parameter and use dask to read its results into a dask DataFrame using the dask.read_sql_query function.

  4. dask: looping over groupby groups efficiently - Stack Overflow

    Mar 25, 2025 · for name in set(ddf['groupby_column'].unique().compute()): group = ddf[ddf['groupby_column'].eq(name)].compute() # Process each group This approach computes the …

  5. python - Difference between dask.distributed LocalCluster with threads ...

    Sep 2, 2019 · What is the difference between the following LocalCluster configurations for dask.distributed? Client(n_workers=4, processes=False, threads_per_worker=1) versus …

  6. dask: difference between client.persist and client.compute

    Jan 23, 2017 · More pragmatically, I recommend using persist when your result is large and needs to be spread among many computers and using compute when your result is small and you want it on just …

  7. At what situation I can use Dask instead of Apache Spark?

    Dask.dataframe does not attempt to implement many pandas features or any of the more exotic data structures like NDFrames Thanks to the Dask developers. It seems like very promising technology. …

  8. Comparison between Modin | Dask | Data.table - Stack Overflow

    May 27, 2021 · dask was the first, has large eco-system and looks really well documented, discussed in forums and demonstrated on videos. modin (ray) has some design choices which allow it to be more …

  9. How to see progress of Dask compute task? - Stack Overflow

    I would like to see a progress bar on Jupyter notebook while I'm running a compute task using Dask, I'm counting all values of id column from a large csv file +4GB, so any ideas? import dask.datafr...

  10. python - Why does dask take long time to compute regardless of the …

    Mar 24, 2022 · The reason dask dataframe is taking more time to compute (shape or any operation) is because when a compute op is called, dask tries to perform operations from the creation of the …