ARROW-185: Make padding and alignment for all buffers be 64 bytes by emkornfield · Pull Request #74 · apache/arrow

emkornfield · 2016-05-11T09:39:43Z

some small cleanup/removal of unnecessary code. I think there is likely a good opportunity to factor this code better generally, but this seems to work for now.

emkornfield · 2016-05-11T09:40:56Z

+  constexpr int64_t multiple_bitmask = round_to - 1;
+  int64_t remainder = num & multiple_bitmask;
+  int rounded = num;
+  if (remainder) { rounded += 64 - remainder; }


should use round_to here. I'm also pretty sure there is something clever we could do to avoid the condition here, but at the moment I'm blanking on it.

Does this do it?

(num + multiple_bitmask) & ~multiple_bitmask

that looks right to me. although the performance gains are probably moot given the other condition for overflow.

emkornfield · 2016-05-11T16:24:58Z

hmm, tests passed locally, will need to take a closer look at what is going on.

wesm · 2016-05-12T02:28:34Z

  // An offset into data that is owned by another buffer, but we want to be
  // able to retain a valid pointer to it even after other shared_ptr's to the
  // parent buffer have been destroyed
+  // TODO(emkornfield) how will this play with 64 byte alignment/padding?


Inevitably alignment and padding isn't always going to be a guarantee on in-memory data (of course when data is moved for IPC purposes, that will need to be guaranteed). I suppose then that buffers will need to be able to communicate their alignment/padding for algorithm selection (i.e. can we use the spiffy AVX512 function or not?)

I think we need to see how use-cases play out. It seems given the current spec, most slicing operations in the general case will need memory allocation anyways. We could likely guarantee alignment/padding by providing a utility method that either allocates slices if it can keep the contract otherwise allocates new underlying data. For now I will put a warning here.

emkornfield · 2016-05-13T08:02:11Z

still need to address other comments, but pushed a commit that should allow C++ tests to pass, I still need to check if python tests are still failing.

emkornfield · 2016-05-17T16:47:11Z

should be ready for review. Not done here is verification of alignment on RPC I will open up a jira to address this, if that is ok.

wesm · 2016-05-17T23:39:59Z

+  //
+  // This method makes no assertions about alignment or padding of the buffer but
+  // in general we expected buffers to be aligned and padded to 64 bytes.  In the future
+  // we might add utility methods to help determine if a buffer satisfies this contract.


Probably what we can do is add a method to produce a buffer that is guaranteed to be aligned and padded (allocating as necessary). For example: if there is incoming data from another library to libarrow that is not aligned or padded, some algorithms may work without alignment or padding, while others (e.g. requiring SIMD) would require the buffer to be "fixed". This could get pretty hairy, though...

I'm thinking about the case where an Arrow array is constructed from memory allocated elsewhere with zero copy

wesm · 2016-05-17T23:42:03Z

LGTM. thank you for the thorough efforts on this. +1

ARROW-185: Make padding and alignment for all buffers be 64 bytes

6ff3048

emkornfield reviewed May 11, 2016
View reviewed changes

wesm reviewed May 12, 2016
View reviewed changes

add back in memsets because they make valgrind happy

05653cb

emkornfield changed the title ~~ARROW-185: Make padding and alignment for all buffers be 64 bytes~~ [WIP] ARROW-185: Make padding and alignment for all buffers be 64 bytes May 13, 2016

emkornfield added 5 commits May 16, 2016 08:56

replace cython string conversion with string builder

11b3fd7

cleanup

7543267

fix lint

c140e04

fix warning

1d006d8

fix cast style

e3cca14

emkornfield changed the title ~~[WIP] ARROW-185: Make padding and alignment for all buffers be 64 bytes~~ ARROW-185: Make padding and alignment for all buffers be 64 bytes May 17, 2016

wesm reviewed May 17, 2016
View reviewed changes

asfgit closed this in 9c59158 May 17, 2016

emkornfield deleted the emk_fix_allocations_PR branch February 26, 2021 05:14

paleolimbot mentioned this pull request Jan 28, 2023

[R] Crash on MacOS (x86) when running tests with homebrew apache-arrow also installed #33903

Closed

github-actions Bot mentioned this pull request Nov 26, 2024

GH-42156: [Java] Handle offset field from ArrowArray when BufferImportTypeVisitor imports offset buffer #43053

Closed

jonkeane mentioned this pull request May 11, 2025

[C++][R]: gcc-UBSAN errors on CRAN #46394

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-185: Make padding and alignment for all buffers be 64 bytes#74

ARROW-185: Make padding and alignment for all buffers be 64 bytes#74
emkornfield wants to merge 7 commits into
apache:masterfrom
emkornfield:emk_fix_allocations_PR

emkornfield commented May 11, 2016

Uh oh!

emkornfield May 11, 2016

Uh oh!

wesm May 12, 2016

Uh oh!

emkornfield May 17, 2016

Uh oh!

emkornfield commented May 11, 2016

Uh oh!

wesm May 12, 2016

Uh oh!

emkornfield May 17, 2016

Uh oh!

emkornfield commented May 13, 2016

Uh oh!

emkornfield commented May 17, 2016

Uh oh!

wesm May 17, 2016

Uh oh!

wesm May 17, 2016

Uh oh!

wesm commented May 17, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

emkornfield commented May 11, 2016

Uh oh!

emkornfield May 11, 2016

Choose a reason for hiding this comment

Uh oh!

wesm May 12, 2016

Choose a reason for hiding this comment

Uh oh!

emkornfield May 17, 2016

Choose a reason for hiding this comment

Uh oh!

emkornfield commented May 11, 2016

Uh oh!

wesm May 12, 2016

Choose a reason for hiding this comment

Uh oh!

emkornfield May 17, 2016

Choose a reason for hiding this comment

Uh oh!

emkornfield commented May 13, 2016

Uh oh!

emkornfield commented May 17, 2016

Uh oh!

wesm May 17, 2016

Choose a reason for hiding this comment

Uh oh!

wesm May 17, 2016

Choose a reason for hiding this comment

Uh oh!

wesm commented May 17, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants