200 likes | 304 Views
Fu-Hsiang’s research log. 2004/1~2004/6. Half-year plan. 2004/1/5~2004/1/11. Works done last week Re-test for external and internal testing with 5KB white page Process pornography homepage text Works planned for this week Process pornography homepage text and try to training these data
E N D
Fu-Hsiang’s research log 2004/1~2004/6
2004/1/5~2004/1/11 • Works done last week • Re-test for external and internal testing with 5KB white page • Process pornography homepage text • Works planned for this week • Process pornography homepage text and try to training these data • Modify external and internal testing results • Notes Last week I retested for external and internal testing with 5KB web page, but there had some mistakes in the results. Therefore, I will modify the external results slides and analysis the internal results to make new slides.
2004/1/12~2004/1/16 • Works done last week • Process pornography webpage text and try to training these data • Modify external and internal testing results • Works planned for this week • Select and put the keywords that N-gram training into the DansGuardian • Modify slides • Notes Last week I processed the webpage text and trained the webpage text by N-gram tool. Therefore, I got many keywords for 2-gram, 3-gram, 4-gram, 5-gram and 6-gram. I will select the keywords and put into the DG this week.
2004/2/2~2004/2/6 • Works planned for this week • Prepare the 預官 exam • Select and add the Chinese keywords into DansGuardian • Notes I will select the Chinese keywords and put into the DansGuardian this week.
2004/2/6~2004/2/13 • Works done last week • 預官 exam • Sieved out the proper Chinese keywords from 2~6gram • Works planned for this week • Finish sieve out the Chinese keywords from training data • Add the keywords with scores into DansGuardian • Notes Last week I sieved out some of the proper Chinese keywords from 2gam to 6gram. However, the keywords is too much. I’ll continue sieving out the Chinese keywords from training data and add them into DansGuardian with scores this week.
2004/2/16~2004/2/20 • Works done last week • Finish sieve out the Chinese keywords from training data • Add the keywords with scores into the DansGuardian • Benchmark the DansGuardian with Chinese keywords • Works planned for this week • Adjust the Chinese keywords’ scores in the DansGuardian • Implement the Early blocking & Early bypassing algorithm into the DansGuardian • Notes Last week, I have completed the work of sieve the Chinese keywords and added them into DansGuardian and did the accurate benchmark. I will adjust the scores of the Chinese keywords in the DansGuardian and implement the Early blocking and Early bypassing algorithm into the DansGuardian this week.
2004/2/23~2004/2/27 • Works done last week • Adjust the scores of Chinese keyword in the DansGuardian • To Analyze the program of DansGuardian • Works planned for this week • Benchmark the DansGuardian accuracy after adjusting the scores • Implement the Early blocking & Early bypassing algorithm into the DansGuardian • Notes Last week, I spent much time on understanding of the code of DansGuardian and spent less time on adjusted the scores of the keywords. This week, I will try to add codes into the DansGuardian and benchmark the DansGuardian accuracy again. Furthermore, I will spend two week to implement the Early blocking & Early bypassing algorithm into the DansGuardian and spend two week to put codes into the webfd.
2004/3/1~2004/3/5 • Works done last week • Benchmark the DansGuardian accuracy after adjusting the scores • Implement the Early blocking & Early bypassing algorithm into the DansGuardian • Works planned for this week • Implement the Early blocking & Early bypassing algorithm into the DansGuardian • Benchmark the DansGuardian with Early blocking & Early bypassing • Notes Last week, I did DansGuardian accuracy benchmark with new scores and modified the DansGuardian but still had many problems. This week, I will try to solve these problems (separated compute scores, blocking threshold, bypassing threshold..).
2004/3/8~2004/3/12 • Works done last week • Finish implement the Early blocking & Early bypassing algorithm into the DansGuardian • Benchmark the DansGuardian latency with Early blocking & Early bypassing • Works planned for this week • Re-benchmark the latency of DansGuardian with Early blocking & Early bypassing • Thesis outline • Notes Last week, I put the Early blocking & Early bypassing codes into the DansGuardian successfully and did the latency benchmark, but there were some problems in the results. This week, I will re-test the latency of DansGuardian with Early blocking & Early bypassing algorithm by putting code into the DansGuardian. Furthermore, I must prepare the thesis outline before Friday.
2004/3/15~2004/3/19 • Works done last week • Thesis outline • Re-benchmark the latency of the DG with Early blocking & Early bypassing • Works planned for this week • Implement the modified Early blocking & Early bypassing algorithm into the DG • Try to use avalanche smartbits to do benchmark of the DG • Chapter 1 of the Thesis • Notes Last week, I re-benchmark the latency of the DG with Early blocking & Early bypassing algorithm, but the throughput was too bad. Maybe I’ll get better results by using avalanche smartbits. However, using avalanche with smartbits at 604 is a troublesome business. This week, I’ll keep coding the modified Early blocking & Early bypassing algorithm into the DG and try to use free time to using avalanche smartbits to test the DG . Furthermore, I will spend much time in writing chapter one of the thesis.
2004/3/22~2004/3/26 • Works done last week • Implement modified Early blocking & Early bypassing algorithm into the DG • Benchmark the latency of the DG • Finish Chapter 1 of the thesis • Works planned for this week • Chapter 2 of the Thesis • To analyze the code of the webfd • Notes Last week, I implemented the modified Early blocking and Early bypassing algorithm into the DG and finish writing chapter 1 of thesis. This week, I will spend time in writing chapter 2 of the thesis and to analyze the code of webfd.
2004/3/29~2004/4/2 • Works done last week • To analyze the code of the webfd • Works planned for this week • Chapter 2 of the Thesis • To analyze the code of the webfd • Notes Last week, I analyzed the codes of the webfd and found out a problem in the architecture. This week, I must spend much time in writing chapter 2 of the thesis and use remainder time to analyze the code of webfd.
2004/4/5~2004/4/9 • Works done last week • To analyze the code of the webfd and DG • Works planned for this week • Chapter 3 of the Thesis • Modify the code of webfd • Notes Last week, I analyzed the codes of the webfd and modified some codes. This week, I will try to finish modifying the code of webfd and writing chapter 3 of the thesis.
2004/4/12~2004/4/16 • Works done last week • To analyze the code of the webfd and DG • Finish chapter 3 of the thesis • Works planned for this week • Chapter 4 of the thesis • Survey issues and solutions of the thesis • Notes Last week, I wrote the chapter 3 of the thesis and had some problem. This week, I will spend more time to survey the algorithms or solutions and write the chapter 4.
2004/4/19~2004/4/23 • Works done last week • Chapter 4 of thesis • Works planned for this week • Modify Chapter 1~4 of thesis • Chapter 5 of thesis • Notes Last week, I wrote the chapter 4 of the thesis and discussed with Po-Ching. This week, I will spend time to modify chapter 1 to 4 of thesis and write chapter 5 of thesis.
2004/4/26~2004/4/30 • Works done last week • Modified chapter 1~4 of thesis • Chapter 5 of thesis • Works planned for this week • Modify Chapter 1~5 of thesis • Modify thesis slides • Notes Last week, I wrote the chapter 5 of thesis and modified the code of webfd. This week, I will spend time to modify chapter 1 to 5 of thesis and slides.
2004/5/3~2004/5/7 • Works done last week • Modified Chapter 1~5 of thesis • Modified slides of thesis • Works planned for this week • Modify Chapter 1~5 of thesis • Modify code of webfd • Notes Last week, I modified slides and chapter 1 to 5 of thesis. This week, I will spend time to modify the codes of webfd.
2004/5/24~2004/5/28 • Works done last week • Benchmarking of webfd / DG • Works planned for this week • Benchmarking of webfd / DG with snort • Notes Last week, I did throughput benchmarking of webfd and DansGuardian. The throughput of webfd with all checking (URL and content keyword) is 12Mbps but the value is not highest point. Because we don’t have enough computers and we just use 8 computers to do the benchmarking. This week, I will continue to do benchmarking of webfd / DG with snort.