Java – 对包含特殊字符的对象列表进行排序

huangapple go评论104阅读模式
英文:

Java - Sort a list of objects that includes a special character

问题

基本上,我有一个对象的ArrayList,如下所示:

  1. [{name: 'A'}, {name: 'B'}, {name:'?'}]

我希望对它们进行排序,以使问号像上面一样排在末尾。

但使用以下代码:

  1. Collections.sort(myList);

这总是导致带有问号的对象首先出现,我认为这是由于ASCII排序造成的?我认为正确的方法是使用比较器函数,但我不确定在字母和特殊字符方面该如何实现?

我应该如何实现这个?

英文:

Basically i have an arraylist of objects like so:

  1. [{name: 'A'}, {name: 'B'}, {name:'?'}]

I want to sort these so that the question mark is at the end like above.

But using the below code.

  1. Collections.sort(myList);

This always results in the object with the question mark first, I think this is due to ASCII ordering? I think the correct way forward is to use a comparator function but i'm not sure how that would take shape with letters and special characters?

How would I implement this?

答案1

得分: 2

一种替代方法是,可以使用基于规则的排序器将任何标点符号排序到末尾。

一个示例:

  1. List<String> words = Arrays.asList(
  2. "?dog", "rat", "456", "123", "dog", "pig", "?cat", "!rat", "cat"
  3. );
  4. String englishRules = ("< a,A < b,B < c,C < d,D < e,E < f,F "
  5. + "< g,G < h,H < i,I < j,J < k,K < l,L "
  6. + "< m,M < n,N < o,O < p,P < q,Q < r,R "
  7. + "< s,S < t,T < u,U < v,V < w,W < x,X "
  8. + "< y,Y < z,Z < 0,1,2,3,4,5,6,7,8,9");
  9. RuleBasedCollator rbc = new RuleBasedCollator(englishRules);
  10. rbc.setStrength(Collator.PRIMARY);
  11. Collections.sort(words, rbc);
  12. words.forEach((word) -> {
  13. out.print(word + " ");
  14. });

这将输出:

  1. cat dog pig rat 123 456 !rat ?cat ?dog

需要注意的点:

  1. 这个特定的示例仅限于英文排序。

  2. 这种通用技术之所以有效,是因为所有未提及的字符都被排序到末尾。因此,不仅标点符号在英文字母和数字之后排序,其他字符/符号(例如其他脚本使用的字符)也是如此。

  3. 如果你想要非Unicode标点符号的排序顺序,它们需要在规则字符串中用单引号括起来:

  1. "... < 0,1,2,3,4,5,6,7,8,9 < '?' < '!'"
英文:

An alternative approach - one which sorts any punctuation to the end - can be achieved using a rules-based collator.

An example:

  1. List&lt;String&gt; words = Arrays.asList(
  2. &quot;?dog&quot;, &quot;rat&quot;, &quot;456&quot;, &quot;123&quot;, &quot;dog&quot;, &quot;pig&quot;, &quot;?cat&quot;, &quot;!rat&quot;, &quot;cat&quot;
  3. );
  4. String englishRules = (&quot;&lt; a,A &lt; b,B &lt; c,C &lt; d,D &lt; e,E &lt; f,F &quot;
  5. + &quot;&lt; g,G &lt; h,H &lt; i,I &lt; j,J &lt; k,K &lt; l,L &quot;
  6. + &quot;&lt; m,M &lt; n,N &lt; o,O &lt; p,P &lt; q,Q &lt; r,R &quot;
  7. + &quot;&lt; s,S &lt; t,T &lt; u,U &lt; v,V &lt; w,W &lt; x,X &quot;
  8. + &quot;&lt; y,Y &lt; z,Z &lt; 0,1,2,3,4,5,6,7,8,9&quot;);
  9. RuleBasedCollator rbc = new RuleBasedCollator(englishRules);
  10. rbc.setStrength(Collator.PRIMARY);
  11. Collections.sort(words, rbc);
  12. words.forEach((word) -&gt; {
  13. out.print(word + &quot; &quot;);
  14. });

This outputs:

  1. cat dog pig rat 123 456 !rat ?cat ?dog

Points to note:

  1. This specific example is limited to an English collation.

  2. The general technique works because all non-mentioned characters are sorted to the end. So, not only are punctuation symbols sorted after English letters and digits - but so is every other character/symbol (e.g. those used by other scripts).

  3. If you want a non-Unicode ordering of punctuation symbols, they need to be surrounded by single quotes in your rule string:

  1. &quot;... &lt; 0,1,2,3,4,5,6,7,8,9 &lt; &#39;?&#39; &lt; &#39;!&#39;&quot;

答案2

得分: 1

在Java 8中,你可以使用两级自定义比较器:

  1. // 给定
  2. List<YourObject> list;
  3. list.sort((o1, o2) -> "?".equals(o1.getName()) ? 1 :
  4. ("?".equals(o2.getName()) ? -1 : o1.getName().compareTo(o2.getName())));

这里的排序逻辑是,如果其中一个或另一个名称为 ?,那么我们始终将该 ? 排在最后。如果两个名称都是 ?,或者两个名称都不是 ?,那么我们使用默认的字典序字符串排序。

英文:

In Java 8, you may use a two-tiered custom comparator:

  1. // given
  2. List&lt;YourObject&gt; list;
  3. list.sort((o1, o2) -&gt; &quot;?&quot;.equals(o1.getName()) ? 1 :
  4. (&quot;?&quot;.equals(o2.getName()) ? -1 : o1.getName().compareTo(o2.getName())));

The sorting logic here is that if one or the other names be ?, then we always sort that ? last. If both names be ?, or if neither be ?, then we sort using the default lexicographical string sorting.

答案3

得分: 1

以下是您要求的代码的中文翻译部分:

  1. 您可以使用`java.util.Comparator`中的一些有用方法,使您的生活变得更加轻松,您的代码也不再容易出错,而不需要考虑if-else或三元运算符:
  2. class MyObj {
  3. private String name;
  4. MyObj(String name) {
  5. this.name = name;
  6. }
  7. String getName() {
  8. return name;
  9. }
  10. @Override
  11. public String toString() {
  12. return "name: '" + name + "'";
  13. }
  14. }
  15. public class Demo {
  16. public static void main(String[] args) {
  17. List<MyObj> lst = Arrays.asList(new MyObj("B"), new MyObj("?"), new MyObj("A"));
  18. Comparator<String> questionMarksLast = Comparator
  19. .<String, Boolean>comparing("?".equals)
  20. .thenComparing(Comparator.naturalOrder());
  21. lst.sort(Comparator.comparing(MyObj::getName, questionMarksLast));
  22. System.out.println(lst); // 输出 [{name: 'A'}, {name: 'B'}, {name: '?'}]
  23. }
  24. }
英文:

You can use some useful methods from java.util.Comparator to make your life easier and your code less error-prone than having to think about if-else or ternary operators:

  1. class MyObj {
  2. private String name;
  3. MyObj(String name) {
  4. this.name = name;
  5. }
  6. String getName() {
  7. return name;
  8. }
  9. @Override
  10. public String toString() {
  11. return &quot;{name: &#39;&quot; + name + &quot;&#39;}&quot;;
  12. }
  13. }
  14. public class Demo {
  15. public static void main(String[] args) {
  16. List&lt;MyObj&gt; lst = Arrays.asList(new MyObj(&quot;B&quot;), new MyObj(&quot;?&quot;), new MyObj(&quot;A&quot;));
  17. Comparator&lt;String&gt; questionMarksLast = Comparator
  18. .&lt;String, Boolean&gt;comparing(&quot;?&quot;::equals)
  19. .thenComparing(Comparator.naturalOrder());
  20. lst.sort(Comparator.comparing(MyObj::getName, questionMarksLast));
  21. System.out.println(lst); // prints [{name: &#39;A&#39;}, {name: &#39;B&#39;}, {name: &#39;?&#39;}]
  22. }
  23. }

huangapple
  • 本文由 发表于 2020年4月3日 23:57:08
  • 转载请务必保留本文链接:https://go.coder-hub.com/61015766.html
匿名

发表评论

匿名网友

:?: :razz: :sad: :evil: :!: :smile: :oops: :grin: :eek: :shock: :???: :cool: :lol: :mad: :twisted: :roll: :wink: :idea: :arrow: :neutral: :cry: :mrgreen:

确定