Thread: In reference to gsoc
Hello !
I am a sophomore pursuing Computer Engineering in NSIT affiliated with Delhi University. I came across your organisation through gsoc portal and am very interested to work on project PostgreSQL sorting algorithm benchmark and implementation.
My skill set include android, python , java , c++/c, git/github. I have good knowledge of data structure and algorithms. I am an hard working person and am willing to learn and research on new algorithms and data structures need for this particular project. I believe that this project fits my skill set.
It would be helpful if someone direct me to the repo I will be working and sorting techniques used. Please give me a brief about the data structures used and also the starting point for getting familiar repository and work that needs to be done in this project.
Awaiting your reply :)
Regards
Tanvi
On Thu, Mar 1, 2018 at 7:50 AM, Tanvi Dadu <tanvidadu1998@gmail.com> wrote: > It would be helpful if someone direct me to the repo I will be working and > sorting techniques used. Please give me a brief about the data structures > used and also the starting point for getting familiar repository and work > that needs to be done in this project. You're probably not going to have a ton of luck working the PostgreSQL community unless you can figure out things like this without help from other people. Most people here are quite busy and don't have time to help you figure out things that you should be able to figure out yourself with 10 minutes and a search engine, and the location of the source code is one of those things. But see: https://www.postgresql.org/docs/current/static/sourcerepo.html https://www.postgresql.org/docs/current/static/git.html I would start by reading the code and comments in src/backend/utils/sort/tuplesort.c See also https://wiki.postgresql.org/wiki/Development_information I don't really know what work needs to be done in terms of sorting at this point. Quite a few improvements have been made over the last few years and I'm not sure exactly what could profitably be done next. Andres Freund mentioned to me yesterday that there might be some work around sorting + JIT once he gets the basic JIT stuff committed; and it would certainly be cool to be able to use the parallel sort stuff we added in commit 9da0cc35284bdbe8d442d732963303ff0e0a40bc for parallel query as well as for parallel index builds. However, I don't have an exact description of what work might need to be done there in either case, and neither constitutes an improvement to the sorting code per se. -- Robert Haas EnterpriseDB: http://www.enterprisedb.com The Enterprise PostgreSQL Company