How Do You Optimize Python Code For Competitive Programming?

Asked 7 months ago
Answer 1
Viewed 276
0

Python is a popular programming language, and many developers use HackerEarth to learn more about it. As a result, the majority of contributions to the challenges presented on our site are in Python.

We all know how we pass the first test case, but the other test cases fail due to memory or time constraints. So, let's have a look at some of the recommendations that should be kept in mind to ensure that a proper Python programme stays inside the platform's restrictions throughout the problems.

1. Get the whole setup ready before-hand

This is common sense. Get the whole setup ready. Launch your Python editor beforehand. If you are writing your files locally, establish and activate a virtual environment. In addition, I would recommend something that may seem contentious and counter-intuitive: adopt TDD. Use your preferred testing tool. I usually use pytest and have "pip" installed in my virtual environment before beginning to write little test scripts. I've discovered that testing improves clarity of mind, which helps me write quicker programmes. This also helps in the restructuring of code to improve its speed. We'll get to that later.

2. Get the code working first

People have their own coding techniques. Use the coding style that makes you most comfortable. For the first iteration, make the code work and submit it. See whether it passes all of the test scenarios. If it's passing, that's it. It's finished. And you may on to the next question.

If certain test cases pass but others fail due to memory concerns, it indicates that there is still work to be done.

3. Python Coding Tips

We can use some pythonic code constructs that give us better performance. Let’s look at them below.

Strings:

Do not use the below construct.

s = ""
for x in somelist:
   s += some_function(x)

Instead use this

slist = [some_function(el) for el in somelist]
s = "".join(slist)

This is because in Python, str is immutable, so the left and right strings have to be copied into the new string for every pair of concatenation.

Language Constructs:

Functions: Make functions of your code and although procedural code is supported in Python, it’s better to write them in functions. You might have heard this advice from proponents of code reusability and champions of TDD (Note: I am one of them), but here I would say that functions are better because they are faster. So

def main():
    for i in xrange(10**8):
        pass

main()

is better than

for i in xrange(10**8):
    pass

This is because of the underlying CPython implementation. In short, it is faster to store local variables than globals. Read more in this SO post.

I would recommend you keep your procedural code as little as possible. You can use the following standard template.

def solution(args):
    # write the code
    pass

def main():
    # write the input logic to take the input from STDIN
    Input_args = ""
    solution(input_args)

If __name__ == "__main__":
    main()

Use the standard library:

Use built-in functions and the standard library as much as possible. Therefore, instead of this:

newlist = []
for item in oldlist:
    newlist.append(myfunc(item))

Use this:

newlist = map(myfunc, oldlist)

There is also the list expressions or the generator expressions.

newlist = [myfunc(item) for item in oldlist] # list expression
newlist = (myfunc(item) for item in oldlist) # generator expression

Similarly, use the standard library, like itertools, as they are generally faster and optimized for common operations. So you can have something like permutation for a loop in just three lines of code.

>>> import itertools
>>> iter = itertools.permutations([1,2,3])
>>> list(iter)
[(1, 2, 3), (1, 3, 2), (2, 1, 3), (2, 3, 1), (3, 1, 2), (3, 2, 1)]

But if the containers are tiny, then any difference between code using libraries is likely to be minimal, and the cost of creating the containers will outweigh the gains they give.

Generators:

A Python generator is a function which returns a generator iterator (just an object we can iterate over) by calling yield. When a generator function calls yield, the “state” of the generator function is frozen; the values of all variables are saved and the next line of code to be executed is recorded until next() is called again. Generators are excellent constructs to reduce both the average time complexity as well as the memory footprint of the code that you have written. Just look at the following code for prime numbers.

def fib():
    a, b = 0, 1
    while 1:
        yield a
        a, b = b, a + b

So, this will keep on generating Fibonacci numbers infinitely without the need to keep all the numbers in a list or any other construct. Please keep in mind that you should use this construct only when you don’t have any absolute need to keep all the generated values because then you will lose the advantage of having a generator construct.

4. Algorithms and Data structures

To make your code run faster, the most important thing that you can do is to take two minutes before writing any code and think about the data-structure that you are going to use. Look at the time complexity for the basic python data-structures and use them based on the operation that is most used in your code. The time complexity of the list taken from python wiki is shown below.

Similarly, keep on reading from all sources about the most efficient data structures and algorithms that you can use. Keep an inventory of the common data structures such as nodes and graphs and remember or keep a handy journal on the situations where they are most appropriate.

Writing fast code is a habit and a skill, which needs to be honed over the years. There are no shortcuts. So do your best and best of luck.

Read Also : Is there anything inappropriate in Avatar The Last Airbender?
Answered 7 months ago Pirkko  KoskitaloPirkko Koskitalo