All Versions
10
Latest Version
Avg Release Cycle
280 days
Latest Release
-

Changelog History

  • v4.0.0 Changes

    • ๐Ÿ’ฅ Breaking: Ruby 2.4 (EOL since 2020-03-31) is not officially supported anymore.
    • ๐Ÿ CI changes: moving from Travis CI (EOL) to GitHub actions. The Windows CI has been removed for now (see #97)
    • ๐Ÿ’ฅ Breaking: if your jobs use Kiba's "legacy runner" via config :kiba, runner: Kiba::Runner, be aware that this legacy runner has been removed in #96. The upgrade path is to remove this config line and let Kiba use the more modern Kiba::StreamingRunner, which is the default anyway since Kiba v3.0.0 (see #83 for context) and is normally fully backward-compatible.
    • ๐Ÿšš Cleanup: in Kiba v3, the kiba shell command had been deprecated and replaced by a simple stub printing a warning to STDERR. It is now removed for good.
    • ๐Ÿ‘• StandardRB has been added for formatting & linting the codebase.
  • v3.6.0 Changes

    • ๐Ÿ†• New: Kiba.run(job) can now (instead of a job parameter) take a block to define the job. See #94 for more details.
  • v3.5.0 Changes

    September 24, 2020

    ๐Ÿš€ This release adds support for Ruby 2.7 and Ruby 3.

    ๐Ÿ‘€ See #93 for detailed information and analysis.

  • v3.0.0 Changes

    February 10, 2020

    ๐Ÿ’ฅ BREAKING - the kiba CLI is deprecated

    ๐Ÿ—„ The kiba CLI is deprecated in favor of the more modern Kiba.parse programmatic API [#74, #81].

    ๐Ÿ‘ The programmatic API allows everything the "command" mode supported, plus much more, and actually encourage better coding practices. For instance:

    • ๐Ÿ”ง API mode allows to pass live variables (rather than just ENV configuration from command line or JSON configs from files)
    • ๐Ÿ‘ท Doing so permits to wrap resources open/close around running a job
    • โœ… API mode makes it easier to run testing on an ETL process (via minitest/rspec) directly in-process (which allows stubbing/webmock etc), rather than via a command call
    • API mode enforces use of clean modules with explicit loading, rather than polluting the top-level namespace with global methods
    • ๐Ÿ‘ท API mode allows to run jobs from Sidekiq or background job systems, from an HTTP call (if the job is fast), without necessarily waiting for a command line binary to run - this supports more dynamic interactions (e.g. a job is created in reaction to an external event received via HTTP or a websocket)

    ๐Ÿ“š A temporary kiba-legacy-cli gem is available (https://github.com/thbar/kiba-legacy-cli) to ease migration, but the recommendation is really to migrate over and use Kiba.parse directly, as described in the current documentation.

    0๏ธโƒฃ Kiba now defaults to StreamingRunner

    ๐Ÿš€ Introduced in v2.0.0 [#44] to ensure a transform could yield N rows for 1 input row, and improved in v2.5.0 [#57] to help implement "buffering transforms", the StreamingRunner is now made the default to process the jobs [#83].

    This change is expected to be backward compatible and will help with reusability & features of ETL components.

    ๐Ÿ’Ž Ruby compatibility notice

    • ๐Ÿ’Ž Kiba now officially supports MRI Ruby 2.4+ (although 2.3 will still work for now), JRuby 9.2+ or TruffleRuby.
    • ๐Ÿ‘€ You may get warnings with Ruby 2.7 and errors with Ruby 2.8+. See [#85] for status on Ruby 3 keyword arguments support.
  • v2.5.0 Changes

    May 29, 2019

    Aggregating / buffering transforms

    ๐Ÿš€ A Transform's close can now yield rows (this requires the new StreamingRunner, see v2.0.0 release notes).

    ๐Ÿ‘ This will let component implementers support new types of scenarios:

    • Batch transforms (such as the upcoming Kiba Pro ParallelTransform, or batch SQL lookups)
    • Grouping of rows (including in-memory or db-backed sort, normalisation operations, map operations)

    ๐Ÿ‘€ See #57 for more background & explanations.

    ๐Ÿ’Ž Ruby compatibility notice

    ๐Ÿ’Ž Kiba now requires MRI Ruby 2.3+, JRuby 9.1+ or TruffleRuby.

    โœ… This is done to reduce the testing burden, to encourage users to avoid EOL'ed rubies, and to let me use more recent Ruby features when relevant.

    Other tweaks

    • ๐Ÿ›  Fix incorrect error message when calling transform nil (#73 - thanks @envygeeks for the report).
    • ๐Ÿ›  Fix code & documentation links on Rubygems (#71 - thanks @janko).
  • v2.0.0 Changes

    January 05, 2018

    ๐Ÿ†• New StreamingRunner engine

    ๐Ÿ‘€ Kiba 2 introduces a new, opt-in engine called the StreamingRunner, which allows to generate an arbitrary number of rows inside class transforms. This drastically improves the reusability & composability of Kiba components (see #44 for some background).

    To use the StreamingRunner, use the following code:

    # activate the new Kiba internal config systemextend Kiba::DSLExtensions::Config# opt-in for the new engineconfig :kiba, runner: Kiba::StreamingRunner# write transform class able to yield an arbitrary number of rowsclass MyYieldingTransformdef process(row) yield {key: 1} yield {key: 2} {key: 3} endend
    

    ๐Ÿ’Ž The improved runner is compatible with Ruby 2.0+.

    ๐Ÿ— โš ๏ธ it is warmly recommended not to share data between the rows yielded this way, otherwise anything changing one row will also affect the others. Make sure to build completely independent rows (or use an immutable Hash structure).

    Compatibility with Kiba 1

    Kiba 2 is expected to be compatible with existing Kiba 1 scripts, as long as you did not use internal API.

    Internal changes include:

    • ๐Ÿ‘ท An opt-in, Elixir's mix-inspired config system, currently only used to select the runner you want at job declaration time
    • ๐Ÿ“œ A stronger isolation in the Parser, to reduces the chances that ETL scripts could conflict with Kiba internal classes
  • v1.0.0 Changes

    December 01, 2016
    • close becomes optional in destinations.
    • โฌ†๏ธ Bumping to 1.0.0 since Kiba is in wide production use.
  • v0.6.1 Changes

    June 28, 2015
    • ๐Ÿ›  Bugfix: make sure files generated by pre_process can safely be read from constructor of sources [#16]. Thanks @bcherrington for catching this.
    • ๐Ÿ”จ Internal refactoring of processing engine (should not affect regular use)
  • v0.6.0 Changes

    May 14, 2015
    • โž• Add pre_process block support
  • v0.5.0 Changes

    May 14, 2015
    • ๐ŸŽ‰ Initial release