Add DigestInputStream utils #110

pathikrit · 2017-02-13T19:04:02Z

No description provided.

solicode

The API changes look fine to me. As for the performance, it looks to be nearly identical with the one exception of the buffer size. 1024 seems like too conservative of a value to me. I think you'll see a decent speed increase by picking a value like 4096 or 8192 as it aligns better with block sizes of most systems. For example, BufferedInputStream.DEFAULT_BUFFER_SIZE is 8192. But if you'd rather tweak that value later after a benchmark is added, that's fine too.

solicode · 2017-02-13T20:50:21Z

core/src/test/scala/better/files/FileSpec.scala

@@ -305,7 +302,6 @@ class FileSpec extends FlatSpec with BeforeAndAfterEach with Matchers {
  }

  it should "compute correct checksum for non-zero length string" in {
-    implicit val charset = StandardCharsets.UTF_8


I added this because the default for writeText is Charset.defaultCharset(), which is system dependent. Otherwise, technically this test can fail on somebody else's machine if their default charset is something unusual.

pathikrit · 2017-02-13T21:06:25Z

@solicode : Thanks for the feedback. Fixed here: 7bd9e6d

pathikrit · 2017-02-14T01:05:12Z

@solicode: May I ask, how are you benchmarking? Can you add it to this repo? I need some help benchmarking another change: #108

The above PR gets around a bug in the JDK by monkey-patching the Charset but I am worried about performance. Thanks!

solicode · 2017-02-14T11:22:40Z

Unfortunately, I wasn't doing anything too sophisticated. Basically using really large files (also numerous files) to ensure that all the processing time was indeed being spent in digest so that I could optimize that.

I was using already existing files on my system to test with, but I suppose we could just generate large files programmatically and test with that.

I haven't used JMH yet, but I'd like to get familiar with it. If you're not going to be working on #62 in the near future, I could take a look at it if you want. I wouldn't be able to do it right away, but maybe sometime this month.

pathikrit · 2017-02-14T16:52:30Z

@solicode : I don't have any plans for #62 anytime soon. So please feel free to contribute. Look at the benchmarks sub-project: https://github.com/pathikrit/better-files/tree/master/benchmarks/src/test/scala/better/files

pathikrit added 2 commits February 13, 2017 13:57

Add DigestInputStream util

5d1c0bf

Add DigestInputStream creation utils

46ee52b

pathikrit merged commit 51c22e8 into master Feb 13, 2017

pathikrit deleted the cleanup-pr-109 branch February 13, 2017 20:05

solicode reviewed Feb 13, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add DigestInputStream utils #110

Add DigestInputStream utils #110

pathikrit commented Feb 13, 2017

solicode left a comment

solicode Feb 13, 2017

pathikrit commented Feb 13, 2017

pathikrit commented Feb 14, 2017

solicode commented Feb 14, 2017

pathikrit commented Feb 14, 2017 •

edited

Loading

Add DigestInputStream utils #110

Add DigestInputStream utils #110

Conversation

pathikrit commented Feb 13, 2017

solicode left a comment

Choose a reason for hiding this comment

solicode Feb 13, 2017

Choose a reason for hiding this comment

pathikrit commented Feb 13, 2017

pathikrit commented Feb 14, 2017

solicode commented Feb 14, 2017

pathikrit commented Feb 14, 2017 • edited Loading

pathikrit commented Feb 14, 2017 •

edited

Loading