数据结构与算法--从一百万个数字中得到前10大的数

本文主要是介绍数据结构与算法--从一百万个数字中得到前10大的数，希望对大家解决编程问题提供一定的参考价值，需要的开发者们随着小编来一起学习吧！

1. 算法如下：根据快速排序划分的思想
(1) 递归对所有数据分成[a,b）b（b,d]两个区间，(b,d]区间内的数都是大于[a,b)区间内的数
(2) 对(b,d]重复(1)操作，直到最右边的区间个数小于100个。注意[a,b)区间不用划分
(3) 返回上一个区间，并返回此区间的数字数目。接着方法仍然是对上一区间的左边进行划分，分为[a2,b2）b2（b2,d2]两个区间，取（b2,d2]区间。如果个数不够，继续(3)操作，如果个数超过100的就重复1操作，直到最后右边只有100个数为止。

2.先取出前100个数，维护一个100个数的最小堆，遍历一遍剩余的元素，在此过程中维护堆就可以了。具体步骤如下：
step1：取前m个元素（例如m=100），建立一个小顶堆。保持一个小顶堆得性质的步骤，运行时间为O（lgm);建立一个小顶堆运行时间为m*O（lgm）=O(m lgm);
step2:顺序读取后续元素，直到结束。每次读取一个元素，如果该元素比堆顶元素小，直接丢弃
如果大于堆顶元素，则用该元素替换堆顶元素，然后保持最小堆性质。最坏情况是每次都需要替换掉堆顶的最小元素，因此需要维护堆的代价为(N-m)*O(lgm);
最后这个堆中的元素就是前最大的10W个。时间复杂度为O(N lgm）。

3.分块查找

先把100w个数分成100份，每份1w个数。先分别找出每1w个数里面的最大的数，然后比较。找出100个最大的数中的最大的数和最小的数，取最大数的这组的第二大的数，与最小的数比较。。。。

 package BigData;  
  
import java.io.*;  
import java.util.PriorityQueue;  
import java.util.Queue;  
  
public class FileTest {  
    public FileTest() {  
    }  
  
    public static void main(String[] args) {  
        // madeData();  
        sortData();  
    }  
  
    private static void sortData() {  
        FileReader fr = null;  
        BufferedReader br = null;  
        Queue<Integer> priorityQueue = new PriorityQueue<Integer>(100);  
        try {  
            fr = new FileReader("F:/add3.txt");  
            br = new BufferedReader(fr);  
            int temp;  
            int temp1;  
            String str = null;  
            long begin3 = System.currentTimeMillis();  
            while ((str = br.readLine()) != null) {  
                temp = Integer.valueOf(str);  
                if (priorityQueue.size() < 100) {  
                    priorityQueue.add(temp);  
                } else {  
                    temp1 = priorityQueue.poll();  
                    if (temp1 < temp) {                        
                        priorityQueue.offer(temp);  
                    }else{  
                        priorityQueue.offer(temp1);  
                    }  
                }  
                System.out.println("FileReader执行耗时:" + priorityQueue.toString());  
            }  
            br.close();  
            fr.close();  
            long end3 = System.currentTimeMillis();  
            System.out.println("FileReader执行耗时:" + (end3 - begin3) + " 豪秒");  
            System.out.println("FileReader执行耗时:" + priorityQueue.toString());  
        } catch (Exception e) {  
            e.printStackTrace();  
        } finally {  
        }  
    }  
  
    private static void madeData() {  
        FileWriter fw = null;  
        int count = 1000000;// 写文件行数  
        try {  
            fw = new FileWriter("F:/add3.txt");  
            int temp = (int) (Math.random() * 1000000);  
            long begin3 = System.currentTimeMillis();  
            for (int i = 0; i < count; i++) {  
                temp = (int) (Math.random() * 1000000);  
                fw.write(temp + "\r\n");  
            }  
            fw.close();  
            long end3 = System.currentTimeMillis();  
            System.out.println("FileWriter执行耗时:" + (end3 - begin3) + " 豪秒");  
        } catch (Exception e) {  
            e.printStackTrace();  
        } finally {  
        }  
    }  
}