Edit - Stack Overflow

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

+1 for stating the bleeding obvious I should've seen when writing my answer. D'oh! :)

gustafc
– gustafc

2009-06-15 14:20:04 +00:00
Commented Jun 15, 2009 at 14:20
True; I was doing it without a temporary file, but it might be a little more efficient with one (no LinkedHashSet necessary). But I'd venture a guess that CPU isn't going to be the bottleneck anyway.

Michael Myers
– Michael Myers ♦

2009-06-15 14:21:05 +00:00
Commented Jun 15, 2009 at 14:21
Er, my comment was directed at Workshop Alex, not gustafc.

Michael Myers
– Michael Myers ♦

2009-06-15 14:22:15 +00:00
Commented Jun 15, 2009 at 14:22
1

Of course, instead of using an output file, you could output to an unsorted string list, in memory. Then, when you're done adding the input without duplicates, write the string list over the old input file. It does mean you'll be using twice as much memory than with other solutions, but it's still extremely fast.

Wim ten Brink
– Wim ten Brink

2009-06-16 09:39:17 +00:00
Commented Jun 16, 2009 at 9:39
1

That's because it stores the strings twice: once in the hash table and once in the string list. (Then again, chances are that both the hashset and string list only store references to strings, in which case it won't eat up that much.)

Wim ten Brink
– Wim ten Brink

2009-06-16 21:15:37 +00:00
Commented Jun 16, 2009 at 21:15

| Show 4 more comments

Correct minor typos or mistakes
Clarify meaning without changing it
Add related resources or links
Always respect the author’s intent
Don’t use edits to reply to the author

create code fences with backticks ` or tildes ~
```
like so
```
add language identifier to highlight code
```python
def function(foo):
print(foo)
```
put returns between paragraphs
for linebreak add 2 spaces at end
_italic_ or **bold**
indent code by 4 spaces
backtick escapes `like _so_`
quote by placing > at start of line
to make links (use https whenever possible)

<https://example.com>

[example](https://example.com)

<a href="https://example.com">example</a>

formatting help »
answering help »

Collectives™ on Stack Overflow